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Title of Invention 

infectious cDNA clone of GB Virus B and Uses Thereof 

Field of Invention 

^ The present invention relates to nucleic acid 

sequence which comprises the genome of an infectious GB 
virus B (GBV-B) clone. The invention also relates to 
the use of the nucleic acid sequence of the infectious 
GB virus B clone to study indirectly the molecular 
JO properties of hepatitis C virus (HCV) , and in the 

production of HCV/GBV-B chimeras. The invention further 
relates to the use of the infectious nucleic acid 
sequence of the GB virus B clone and the HCV/GBV-B 
chimeras in the development of vaccines and therapeutics 
for HCV. 

Baglcaround of Invention 

Transmission studies of potential human 
hepatitis agents were first reported in 1967 (Deinhardt 
1967) . Four tamarins inoculated with acute phase sera 
from a surgeon with acute hepatitis (patient GB) 
developed hepatitis, as did most tamarins inoculated in 
serial passage studies. Subsequent studies indicated 

25 that the etiological agent responsible for the 

development of hepatitis in these animals was not any of 
the known human hepatitis viruses (Purcell 1994) . In 
1995, two related RNA viruses named GB virus-B (GBV-B) 
and GB virus A (GBV-A) were identified in acute phase 
sera of a tamarin which developed hepatitis following 
inoculation with serum of the eleventh tamarin passage 
of the putative GB agent (Simons 1995a) . 

GBV-B infection of tamarins resulted in acute 

35 resolving hepatitis (Schlauder 1995, Buhk 1997) . The 
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natural host of GBV-B is still unknown as the virus has 
not been detected in uninoculated animals or in humans. 

GBV-A, on the other hand, is an indigenous 
tamarin virus rather than a component of the original GB 
inoculum (Bukh 1997, Erker 1998) . Experimental 
infection of tamarins with GBV-A did not produce 
hepatitis (Schlauder 1995). A human agent, GBV-C or 
hepatitis G virus, most closely related to GBV-A, was 
later identified (Simons 1995b, Linnen 1996) . However, 
it is still not clear whether this virus actually causes 
hepatitis (Alter 1998, Bukh 1998a). Thus, of the known 
GB viruses, GBV-B may be the only true hepatitis virus. 

Based on analysis of their genomic seq[uences, 
GBV-A, GBV-B and GBV-C were classified as members of the 
Flaviviridae family of viruses, and among the known 
viruses, GBV-B is the virus most closely related to 
hepatitis C virus (HCV) (Muerhoff 1995, Robertson 1998).. 

The GBV-B virus contains a positive-sense, 
single -stranded RNA genome of 9143 nucleotides (nts) 
(Simons 1995a, Muerhoff 1995) . The viral genome of GBV- 
B consists of a 5' untranslated region (UTR) , a single 
long open reading frame (ORP) and a 3' UTR. Based on 
known motifs, structural proteins were predicted to be 
encoded in the 5' portion of the ORF and nonstructural 
(NS) proteins in the 3' portion of the ORF (Muerhoff 
1995) . The hydropathy plots of the polyproteins of GBV- 
B and HCV are very similar even though the overall 
homology of the predicted polyproteins between GBV-B and 
HCV is only about 25-30% (Muerhoff 1995) . The putative 
envelope proteins (El and E2) of GBV-B and HCV share 
common structural features, and significant homology was 
observed between the NS3 serine protease, the NS3 RNA 
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helicase, and the NS5 RNA-dependent RNA polymerase 
regions of GBV-B and HCV (Muerhoff 1995) . Furthermore, 
the function and substrate specificity of the GBV-B and 
HCV NS3 serine proteases are also similar (Scarselli 
5 1997) . The genomic structure and organization of GBV-B 
and HCV share additional features of interest. First, 
colinear regions with significant sequence homology were 
identified in the 5' UTRs (Muerhoff 1995) and the 
predicted IRES structure of GBV-B is similar to that of 

'® HCV (Lemon 1997). Second, both viruses begin the 3' UTR 
with a short sequence followed by a poly (U) stretch 
followed by additional nucleotides (50 nucleotides for 
GBV-B and 98 nucleotides for HCV). However, the 3' 

J3 terminal sequence of HCV forms a stable stem- loop 

structure (Kolykhalov 1996) whereas the published 3' 
terminal sequence of GBV-B does not . 

To date, molecular studies of HCV are severely 
limited by the lack of an efficient cell culture system 

20 

for the virus and by expense and limited availability of 
chimpanzees, the sole animal model for HCV, 
Accordingly, a less expensive and more readily available 
animal than chimpanzees is necessary as an animal model 
25 for the study of HCV. 

Summary of Invention 

The present invention relates to nucleic acid 
sequence which comprises the genome of an infectious GB 
30 virus B (GBV-B) clone. It is therefore an object of the 
invention to provide nucleic acid sequence which encodes 
an infectious GBV-B. Such nucleic acid sequence is 
referred to throughout the application as "infectious 
nucleic acid sequence". 
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As significant structural homology exists 
between the genomes of GBV-B and HCV, the invention also 
relates to the use of infection of tamarins with the 
infectious nucleic acid sequence of GBV-B or with 
mutants of the infectious sequence to study indirectly 
the molecular properties of hepatitis C virus (HCV) or 
as a preliminary screen to identify agents which have 
antiviral activity against HCV. 

The invention further relates to "chimeric 
nucleic acid sequences" consisting of portions of the 
infectious nucleic acid sequence of GBV-B and portions 
of the nucleic acid sequences of other viruses closely 
related to GBV-B such as HCV, GBV-C or other members of 
the Flavivxridae family which do not replicate in 
tamarins. In a preferred embodiment, the chimeric 
nucleic acid sequences consist of portions of the 
infectious nucleic acid sequence of GBV-B and portions 
of the nucleic acid sequence of HCV. The nucleic acid 
sequences taken from GBV-B and HCV can be open- reading 
frame sequences, and/or sequences from the 5'UTR and/or 
3'UTR. 

In one embodiment, GBV-B/HCV chimeras may be 
made in which 5' or 3' UTR sequences of the GBV-B 
infectious clone are replaced with the corresponding 
sequence from an HCV clone. 

In another embodiment, GBV-B/HCV chimeras may 
be constructed in which the structural or non- structural 
regions of GBV-B are replaced by corresponding regions 
of HCV. Thus, such a chimera would contain, for 
example, the HCV structual region in a GBV-B "genomic 
backbone". Of course, it is understood by one of skill 
35 in the art that the construction of the above -described 
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chimeric nucleic acid sequences may be reversed such 
that, for example, the GBV structural region may replace 
the structual region of an HCV genome to produce a 
chimera in which the GBV structural region is contained 
J in an HCV backbone. 

The invention further relates to the use of 
the chimeric nucleic acid sequences of the invention to 
study the functions of HCV genes, and for the 
development of vaccine and antiviral agents against HCV. 

The invention also relates to the use of the 
infectious GBV-B nucleic acid sequence, the mutated GBV- 
B nucleic acid sequences or the chimeric sequences of 
the invention to identify cell lines capable of 
supporting the replication of GBV-B or the chimeras of 
the invention. 

The present invention also relates to the 
polypeptides encoded by the nucleic acid sequences of 
the invention or fragments thereof. 

The present invention further relates to the 
in vitro and in vivo production of GBV-B, mutant GBV-B 
viruses or chimeric GBV-B/HCV viruses from the nucleic 
acid sequences of the invention. 
25 The invention also provides that the nucleic 

acid sequences and viruses of the invention be supplied 
in the form of a kit, alone or in the form of a 
pharmaceut i cal composi t ion . 

30 Brief Description Of Figures 

Figure 1 shows a flow diagram of GB virus 
transmission studies in two species of tamarins, 
Saguinus wystax (SM) and Saguinus oedipus (SO) . The 
35 animals infected with GBV-B (Simons 1995a) are boxed. 
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Two serum pools (GB 8/93 and GB 2/94) were made from 
acutely infected animals. Both pools contained GBV-B, 
as well as GBV-A (Simons 1995) at a titer of 10° genome 
equivalent {GE)/ml. A 10% liver homogenate (CT 11/91) 
was made from a sacrificed tamarin. A number of S. 
mystax tamarins (SM 737, 749, 750, 760, 782, 795 and 
799) and S. oedipus tamarins (SO 100) were naturally 
infected with GBV-Asm and GBV-Aso# respectively, prior to 
inoculation (Bukh 1997) . Only two tamarins (SM 720 and 
748) , both GBV-AsM negative, became infected with GBV-A 
(Simons 1995) following inoculation. Tamarins SM4 2 and 
SM670 were not tested for GBV-A or GBV-Asm* 

Figure 2 shows the course of GBV-B infection in 
tamarins (S. mystax) inoculated with a dilution series of 
the GB 2/94 pool. All animals were inoculated 
intravenously at week 0 with 1 ml of the indicated 
dilution. Results of qualitative RT-nested PGR for GBV-B 
in serum are shown at the top (filled circles, positive;* 
empty circles, negative) . Serum levels of isocitrate 
dehydrogenase (ICD in units/ml) ; shaded area) and the 
estimated logio GBV-B GE titer (vertical columns) were 
plotted against time. 

Figure 3 shows alignment of the 3* UTR 
sequences of GBV-B. The sequence of the infectious clone 
of GBV-B (pGBB) is shown at the top (nts. 9038-9399) . 
The other sequences shown are: pGBB5-l, a non- infectious 
clone of GBV-B; GBV-B, a prototype of GBV-B (Simons 
1995) ; eleven "gb" clones obtained from CT 11/91 liver 
homogenate by 5* RACE on the minus-strand GBV-B RNA; four 
"29" clones obtained from GB 2/94 pool by RT-PCR across 
5 • -to-3 • -end-ligated viral GBV-B RNA; and seven "GBB3" 
clones obtained from GB 2/94 pool by standard RT-PCR. 
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With pGBB as the reference, nucleotide substitutions or 
insertions are shovm as uppercase letters, identical 
nucleotides are shown as dots and nucleotide deletions 
are shown as dashes. 

Figure 4 shows the predicted secondary 
structure of the 3' UTRs of GBV-B and HCV as determined 
by the program "mfold" (Genetics Computer Group) . 

Figure 5 shows the course of GBV-B infection in 
S. mystax tamarins transfected with RNA transcripts of 
pGBB. Both animals were negative for GBV-Asm- At week 0 
transcription mixtures were injected into tamarins by 
percutaneous intrahepatic injection guided by ultrasound. 
Results of qualitative RT-nested PGR for GBV-B in serum 
is shown at the top (filled circles, positive; empty 
circles, negative) . Serum levels of isocitrate 
dehydrogenase (ICD in units/ml; shaded area) and the 
estimated log^o GBV-B GE titer (vertical columns) were 
plotted against time. 

Figures 6A-6F show the nucleotide sequence of 
the infectious hepatitis C virus clone of genotype la 
strain H77C and Figures 6G-6H show the amino acid 
seq[uence encoded by the clone . 

Figures 7A-7F show the nucleotide sequence of 
the infectious hepatitis C virus clone of genotype lb 
strain HC-J4 and Figures 7G-H show the amino acid 
sequence encoded by the clone. 

Description of The Invention 

The present invention relates to nucleic acid 
sequence which comprises the genome of an infectious GB 
virus B (GBV-B) clone. The nucleic acid sequence which 
comprises the genome of an infectious GBV-B virus is 
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shown in SEQ ID N0:1 and is contained in the plasmid 
construct pGBB deposited with the American Type Culture 
Collection (ATCC) on May 28, 1999 and having ATCC 
accession number PTA-152. The present invention relates 
5 to the identification of a 260 nucleotide sequence at 

the 3' end of the infectious GBV-B clone which is shown 
in Example 3 to be necessary for the development of the 
infectious clone. 

Since GBV-B is the virus most closely related 
to HCV, the present invention also relates to 
experimental infection of tamarins with the infectious 
GBV-B clone of the invention or with mutants of the 
infectious GBV clone to study indirectly the molecular 
15 properties of HCV or as a preliminary screen to identify 
agents which have antiviral activity against HCV. For 
example, since the predicted internal ribosome entry 
site (IRES) structure in the 5'UTR of GBV-B is similar 
to that of HCV (Lemon 1997) , the NS3 serine proteases of 

20 

GBV-B and HCV have been shown to share substrate 
specificity in vitro (Scarselli 1997), and the 3'UTRs of 
HCV (Yanagi 1999) and GBV-B (see Examples) have been 
shown to be critical for viral infectivity, mutagenesis 
25 of these regions in the GBV-B infectious clone may be 

undertaken to examine IRES function, NS3 serine protease 
activity or the role of the 3'UTR in viral infectivity 
in vivo . Where such "mutations" are introduced into the 
GBV-B clone of the invention to create a **mutated" GBV-B 

30 

sequence, the mutations include, but are not limited to, 
point mutations, deletions and insertions. Of course, 
one of ordinary skill in the art would recognize that 
the size of the insertions would be limited by the 
35 ability of the resultant nucleic acid sequence to be 
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** properly packaged within the virion. Such mutations 

could be produced by techniques known to those of skill 
in the art such as site-directed mutagenesis, fusion 
PGR, and restriction digestion followed by religation. 
J Alternatively, given the significant 

structural homology that exists between the genomes of 
GBV and HCV, the infectious GBV-B clone may be used to 
screen for inhibitors of IRES function or viral enzyme 
activity (for example, NS3 helicase, NS3 protease, NS2- 
NS3 protease or NS5B RNA polymerase activity) . Such 
inhibitors may be useful as antiviral agents to HCV 
since viral enzyme activity and IRES function are known 
to be critical for HCV replication. 
15 The effect of such inhibitors on the IRES 

function or viral activity of the GBV-B encoded by the 
infectious sequence of the invention may be measured by 
assays known to those of skill in the art to measure 
directly or indirectly viral replication or viral 

20 

pathogenicity. Such assays include, but are not limited 
to, the measurement of virus titer in serum or liver of 
an infected tamarin by PGR or the measurement of GBV-B 
viral protein expression in liver cells of an infected 
25 tamarin by immunof lourescence or Western blot . Of 

course, it is understood that a comparison of results 
obtained for control tamarins (treated only with 
infectious nucleic acid sequence) with those obtained 
for treated tamarins (nucleic acid sequence and 

30 

antiviral agent) would indicate, the degree, if any, of 
antiviral activity of the candidate antiviral agent. Of 
course, one of ordinary skill in the art would readily 
understand that the tamarins can be treated with the 
35 candidate antiviral agent either before or after 
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^ exposure to the infectious nucleic acid sequence of the 
present invent ion . 

In yet another embodiment, the invention 
relates to "chimeric nucleic acid sequences" which 

5 consist of portions of the infectious nucleic acid 

sequence of GBV-B and portions of nucleic acid sequences 
of viruses which are related to GBV-B such as HCV, GBV-C 
and other members of the Flaviviridae family which do 
not infect tamarins. In a preferred embodiment, 
chimeric nucleic acid sequences consist of portions of 
the infectious nucleic acid sequence of GBV-B and 
portions of nucleic acid sequences of hepatitis C 
viruses (HCV) of various genotypes or subtypes; 
15 preferably portions of nucleic acid sequence of 

infectious HCV clones of genotypes la (ATCC accession 
number PTA-157; Figures 6A-6F) , lb {ATCC accession 
number 209596; Figures 7A-7F) or 2a (ATCC accession 
number PTA-153; SEQ ID NO: 4). The nucleic acid 

20 

sequences taken from GBV-B and HCV can be open- reading 
frame sequences, and/or sequences from the 5'UTR and/or 
3'UTR. The gene borders of the HCV genome, including 
nucleotide and amino acid locations, have been 
25 determined, for example, as depicted in Houghton, M. 

(1996), and the putative gene borders of the GBV-B are 
shown in Table 1. 

Of course, it is understood that the 
production of GBV-B/HCV chimeras could include insertion 

30 

of specific genes or regions of the infectious GBV-B 
clone into an HCV "genomic backbone" (where the HCV 
genomic backbone is preferably an infectious nucleic 
acid sequence of HCV genotypes la, lb or 2a described 
35 above) or alternatively, could include insertion of 
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specific genes (or portions thereof) or regions of an 
HCV genome into the GBV-B infectious clone of the 
invention. Of course, where HCV genes or regions are to 
be inserted into the GBV-B infectious clone, it is to be 
understood that the inserted HCV sequences may be 
unmodified or may be mutated in order to examine the 
effect of the mutation (s) on the function of the 
inserted HCV gene or region in the chimeric GBV-B-HCV 
virus . 

Such chimeras can readily be produced by 
methods known to those of ordinary skill in the art. 

In one embodiment, GBV-B/HCV chimeras may be 
made in which 5' or 3' UTR sequences of the GBV-B 
15 infectious clone are replaced with the corresponding 

sequence from an HCV clone. For example, chimeras may 
be constructed in which the IRES sequence of the 
infectious GBV-B clone is replaced by the IRES sequence 
of HCV. Such chimeras can be used in identifying 
inhibitors of IRES activity which would be useful as 
antiviral agents, or could be used to examine HCV IRES 
function in vivo . Alternatively, mutations could be 
introduced into the HCV IRES contained in the GBV-B 
25 clone in order to examine the effect of the mutation (s) 
on IRES function in vivo . 

Alternatively, GBV-B/HCV chimeras may be made 
in which the 3 'UTR sequence of GBV-B is replaced by the 
3 'UTR sequence of HCV. As the 3' terminal stem- loop 
structure is believed to be important for initiation of 
RNA replication and has been shown to be critical for 
infectivity of HCV in vivo , such chimeras may be used 
for more detailed analysis of the function of the 3' UTR 

35 



20 
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sequence of HCV in vivo and for the testing of candidate 
antiviral agents. 

In another embodiment, GBV-B/HCV chimeras may 
be constructed in which the structural or non- structural 
regions of GBV-B are replaced by corresponding regions 
of HCV. Such chimeras would be useful in identifying 
whether the inability of HCV to infect tamarins is due 
to the inability of HCV's structural region to bind the 
receptor necessary for infection of tamarins or to the 
absence of sequences in HCV' s nonstructural regions 
which are necessary for replication in tamarins. For 
example, the ability to infect tamarins with GBV-B/HCV 
chimeras in which the non- structural region of GBV-B is 
replaced by the non- structural region of HCV would 
indicate that the structural genes of GBV-B are 
necessary for viral infection in tamarins, and that the 
inability of HCV to infect tamarins is likely due to its 
lack of receptors for HCV. 

Alternatively, the ability to infect tamarins 
with GBV-B/HCV chimeras in which the structural region 
of GBV-B is replaced by the structural region of HCV 
would indicate that the non -structural genes of GBV-B 
are critical for viral infection in tamarins, and that 
the inability of HCV to infect tamarins is likely due to 
HCV's lack of nonstructural sequences which are 
necessary for replication in tamarins. 

Of course, GBV-B-HCV chimeras may be 
constructed in which only a portion of the non- 
structural or structural regions of GBV-B are replaced 
by the corresponding portions of HCV sequences. For 
example, a chimera in which only one or two of the three 
structural genes (C, El and E2) of GBV-B are replaced by 
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° the corresponding HCV structural genes may be made. In 
one embodiment, nucleic acid sequences comprising the El 
and E2 genes of GBV-B may be replaced by the sequences 
comprising the HCV El and E2 genes. In another 
2 embodiment, nucleic acid sequence comprising either the. 
El or E2 gene of GBV-B is replaced by sequence encoding 
either the HCV El or E2 gene. 

Alternatively, only a fragment of a GBV-B 
structural gene in the infectious GBV clone may be 

^® replaced with the corresponding HCV gene fragments. For 
example, the amino terminal of the GBV-B El gene may be 
replaced by the corresponding portion of an HCV El gene 
or an amino terminal portion of the GBV-B E2 gene may be 

15 replaced by an amino terminal portion of HCV E2 gene tht 
containing the HVRl region. As the structural genes of 
HCV are believed to be important for neutralization, 
chimeras containing an HCV structural gene(s) or 
fragment (s) thereof can be used to develop vaccines 

20 

agaxnst HCV. 

In yet another embodiment, chimeras in which 
individual non- structural genes of GBV-B, such as NS3 
RNA helicase, NS3 protease, or the NS5B RNA-dependent 
25 RNA polymerase are replaced by the corresponding non- 
structural genes of HCV may be constructed. Such 
chimeras would, for example, be useful in identifying 
inhibitors of viral enzyme activity which would be 
useful as antiviral agents. Of course, it is understood 

30 

that in order to construct chimeras in which the 
polyprotein cleavage sites of the GBV-B remain intact, 
it may be desirable to replace only a fragment of a 
nonstructural gene of GBV-B with the corresponding HCV 
35 gene fragment . 
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The present invention also relates to 
polypeptides encoded by the nucleic acid sequences of 
the invention or fragments thereof. In one embodiment, 
said polypeptide or polypeptides may be fully or 
partially purified from viruses produced by cells 
transfected with the nucleic acid sequences of the 
invention. In another embodiment, the polypeptide or 
polypeptides may be produced recombinantly from a 
fragment of the nucleic acid sequences of the invention. 
In yet another embodiment, the polypeptides may be 
chemically synthesized. 

The present invention further relates to the 
in vitro and in vivo production of GBV-B, mutated GBV-B 
or chimeric GBV-B/HCV viruses from the nucleic acid 
sequences of the invention. 

In one embodiment, the sequences of the 
invention can be inserted into an expression vector that 
functions in eukaryotic cells. Eukaryotic expression 
vectors suitable for producing high efficiency gene 
transfer in vivo are well known to those of ordinary 
skill in the art and include, but are not limited to, 
plasmids, vaccinia viruses, retroviruses, adenoviruses 
and adeno-associated viruses. 

In another embodiment, the sequences contained 
in the recombinant expression vector can be transcribed 
in vitro by methods known to those of ordinary skill in 
the art in order to produce RNA transcripts which encode 
the GBV-B of the invention. The GBV-B of the invention 
may then be produced by transfecting cells by methods 
known to those of ordinary skill in the art with either 
the in vitro transcription mixture containing the RNA 
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transcripts or with the recombinant expression vectors 
containing the nucleic acid sequences described herein. 

In assaying the ability of the mutated GBV-B 
sequences or of the chimeric sequences of the invention 
to infect taraarins, the virulence phenotype of the virus 
produced by transfection of tamarins with the sequences 
of the invention can be monitored by methods known in 
the art such as measurement of liver enzyme levels 
(alanine aminotransferase (ALT) or isocitrate 
dehydrogenase (ICD) ) or by histopathology of liver 
biopsies . 

The present invention also relates to the use 
of the infectious GBV-B sequence, the mutated GBV-B 
nucleic acid sequences or the chimeric sequences of the 
invention to identify cell lines capable of supporting 
the replication of GBV-B or the chimeras of the 
invention. 

Transfection of tissue culture cells with the 
nucleic acid sequences of the invention may be done by 
methods of transfection known in the art such as 
electroporation, precipitation with DEAE-Dextran or 
calcium phosphate, or incorporation into liposomes. 

In one such embodiment, the method comprises 
the growing of animal cells in vitro and transfecting 
the cells with the nucleic acid of the invention, then 
determining if the cells show indicia of GBV-B or HCV 
infection. Such indicia include the detection of viral 
antigens in the cell, for example, by immunofluorescence 
procedures well known in the art; the detection of viral 
polypeptides by Western blotting using antibodies 
specific therefor; and the detection of newly 
transcribed viral RNA within the cells via methods such 
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as RT-PCR. The presence of live, infectious virus 
particles following such tests may also be shown by 
injection of cell culture medium or cell lysates into 
healthy, susceptible animals, with subsequent exhibition 
5 of the signs and symptoms of GBV-B infection. 

Suitable cells or cell lines for culturing 
GBV-B or the chimeric GBV-B-HCV include, but are not 
limited to, lymphocyte and hepatocyte cell lines known 
in the art . 

^® Alternatively, primary hepatocytes can be 

cultured, and then infected; or, the hepatocyte cultures 
could be derived from the livers of infected tamarins. 
In addition, various immortalization methods known to 

15 those of ordinary skill in the art can be used to obtain 
cell-lines derived from hepatocyte cultures. For 
example, primary hepatocyte cultures may be fused to a 
variety of cells to maintain stability. 

The invention also provides that the nucleic 

20 

acid sequences and viruses of the invention be supplied 
in the form of a kit, alone or in the form of a 
pharmaceutical composition . 

All scientific publication and/or patents 
25 cited herein are specifically incorporated by reference. 
The following examples illustrate various aspects of the 
invention but are in no way intended to limit the scope 
thereof . 

EXAMPLES 

30 

Materials and Methods 

Source of GB virus B 

Two tamarin pools VR-806, {American Type 
35 Culture Collection) and H205, were used for experimental 
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o 

transmission of the GB virus agents to tamarins species 
Saguinus my s tax and Saguinus oedipus. 

Amplification, cloning and sequence analysis of GBV-B 

Viral RNA was extracted from aliquots of the 
^ GB 2/94 serum pool or CT 11/91 liver homogenate with the 
TRIzol system (GIBCO/BRL) . Primers used in cDNA 
synthesis and PGR amplification were based on the 
genomic sequence of GBV-B published by Simons et al 
10 (Simons 1995) shown in SEQ ID N0:3. Long RT-PCR was 

performed using Superscript II reverse transcriptase 
(GIBGO/BRL) and the Advantage cDNA polymerase mix 
(Clontech) as described previously (Tellier 1996) . Four 
subgenomic regions of GBV-B covering the entire 
published sequence (Simons 1995) were amplified from 
serum and the PGR products were purified and cloned into 
pGEM-9Zf {-) (Promega) or pCR2 . 1 vector (Invitrogen) 
using standard procedures. 
20 The 5' terminus of GBV-B was amplified from 

serum by using the rapid amplification of cDNA ends 
(RACE) with dC or dA tailing (GIBCO/BRL) and GBV-B 
specific antisense primers. Two different approaches 
were used to determine the 3 ' terminal sequence of GBV- 
B. In one approach, GBV-B RNA extracted from serum was 
circularized with T4 RNA ligase (Promega) and the S'-to- 
3 * -end-ligated viral RNA was amplified in RT-PCR using 
specific GBV-B primers. In the second approach, the 5' 
30 end of the negative strand GBV-B RNA extracted from the 
liver homogenate was amplified using the 5' RACE with dC 
tailing and GBV-B specific sense primers. The PGR 
products were cloned directly into pCR2.1-TOPO by using 
the TOPO TA Cloning Kit (Invitrogen) . 
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The consensus sequence of GBV-B was determined 
by direct sequencing of PGR products (nucleotides 1-9078 
and nucleotides 9130-9359) and by sequence analysis of 
the clones (nucleotides 1-7135 and nucleotides 7151- 
93 99) . Nucleotide positions correspond to those of the 
infectious clone (pGBB) . Analyses of genomic sequences 
were performed with GeneWorks (Oxford Molecular Group) 
(Bukh 1995) . To determine whether the GenBank data base 
contained sequences with homology to the GBV-B 3 • UTR 
sequence identified in the present invention, a "Blast" 
search was performed. The predicted secondary structure 
of the GBV-B and HCV 3' UTR sequences were determined by 
the program "mfold" (Genetics Computer Group) . 

Construction of consensus cDNA clones of GBV-B 

First, clone pGBB5-l, a consensus clone of 
GBV-B 2/94 containing the 3' terminus of GBV-B as 
published by Simons et al was constructed (Simons 
1995a)- The core sequence of the T7 promoter, a 5' 
guanosine residue and the sequence of GBV-B (913 9 
nucleotides) were cloned into pGEM-9Zf {-) vector using 
Notl/Sacl sites. A BamHl site was included at the GBV-B 
3' terminus. Digested fragments containing the 
consensus sequence were purified from subclones and 
ligated using convenient sites. Next, a second 
consensus clone of GBV-B, clone pGBB, was constructed by 
inserting the additional 3* terminal sequence, amplified 
by PGR from one of the clones obtained by the RACE 
procedure described above, into pGBB5-l using Xmal (at 
position 9114) and BamHl sites. A Xhol site was 
inserted following the GBV-B 3* terminus. DH5-alpha 
competent cells (GIBCO BRL) were transformed and 
selected on LB agar plates containing 100 ng/ml 
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ampicillin (SIGMA) and amplified in LB liquid cultures 
at 30°C for 18-20 hrs (Yanagi 1997) . Each cDNA clone 
was re -transformed to select a single clone, and 
large-scale preparation of plasmid DNA was performed 
5 with a QIAGEN plasmid Maxi kit as described previously 
(Yanagi 1997) . Each clone was genetically stable since 
the digestion pattern was as expected following 
retransf ormation and the complete sequence was the 
expected one. 

10 

Intrahepatic transfection of tamarins with transcribed 
GBV-B RNA 

In 100 fil reactions, RNA was transcribed in 
vitro with T7 RNA polymerase (Promega) from 10 jig of 

IS linearized template plasmid. The plasmid pGBB5-l was 

linearized with BamHl (Promega) and the plasmid pGBB was 
linearized with Xhol (Promega) . The integrity of the 
RNA was checked by electrophoresis through agarose gel 

2Q stained with ethidium bromide. Each transcription 
mixture was diluted with 400 fil of ice-cold 
phosphate -buffered saline without calcium or magnesium 
(SIGMA) and then immediately frozen on dry ice and 
stored at -80^C. Within 24 hours of synthesis, two 

25 

transcription mixtures were injected into each tamarin 
by percutaneous intrahepatic injection guided by 
ultrasound (Emerson, 1992; Yanagi 1998, 1999) . If the 
tamarin did not become infected, the same transfection 
30 was repeated once. All transfected animals were 
negative for GBV-Asm as determined by the protocol 
described previously (Bukh 1997a) . 

Monitoring of experimental course in tamarins 

35 
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Serum samples were collected weekly from the 
tamarins and monitored for liver enzyme levels [alanine 
aminotransferase (ALT) , garama-glutamyltranspeptidase 
(GGT) , and isocitrate dehydrogenase (ICD)] by standard 
methods and for GBV-B RNA by a specific reverse 
transcriptase -polymerase chain reaction (RT-PCR) assay. 
Total RNA was extracted from 100 |il of serum using the 
TRIzol reagent. The RNA pellet was resuspended in 10 mM 
dithiothreitol (DTT) containing 5% (vol/vol) of RNasin 
(20-40 u/pl) (Promega) . The RT-nested PGR was performed 
with primers from the 5' UTR of GBV-B (external primer 
pair: 5'-CCT AGO AGG GCG TGG GGG ATT TCC-3' and 5 ' -AGG 
TCT GCG TCC TTG GTA GTG ACC-3'; internal primer pair: 
15 5»-GGA TTT CCC CTG CCC GTC TG-3' and 5 • -CCC CGG TCT TCC 

CTA CAG TG-3*). The reverse transcription was performed 
with avian myeloblastosis virus reverse transcriptase 
(Promega) and the external ant i- sense primer and nested 
PGR was performed with AmpliTaq DNA polymerase or 
AmpliTaq Gold DNA polymerase (Perkin Elmer) as described 
previously (Bukh 1998a) . Specificity was confirmed by 
sequence analysis of selected DNA products. Each set of 
experiments included a positive control sample (a 10*® 
dilution of GB 8/93, estimated titer 100 genome 
equivalent (GE) ) and appropriate negative control 
samples. The genome equivalent (GE) titer of GBV-B in 
positive samples was determined by RT-nested PGR on 
10-fold serial dilutions of the extracted RNA (Bukh 
1998a) . One GE was defined as the number of GBV-B 
genomes present in the highest dilution positive in RT- 
nested PGR. The sensitivity of this RT-nested PGR assay 
for GBV-B was equivalent to that of our RT-nested PCR 
assay for HCV (Bukh 1998b), for example, conserved NS3 
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primers which had the same sensitivity for GBV-B as the 
5' UTR primers could detect HCV at optimal sensitivity 
in samples with known HCV genome titer. Testing for 
GBV-A and GBV-A variants was performed by RT-nested PGR 
assays as described previously (Bukh 1997a) . 

The consensus sequence of the complete ORF was 
determined by direct sequencing of overlapping PGR 
products obtained by long RT-nested PGR on serum from 
one of the tamarins infected with RNA transcripts as 
previously described (Yanagi 1997) . 

Example 1 

Transmission of GB Agent in Tamarins 

To generate virus pools of the GB agent, 
tamarins were inoculated intravenously with pooled sera 
of the eleventh tamarin passage of this agent (Fig. 1) . 
Acute phase sera from a 5. mystax tamarin which 
developed hepatitis were pooled (GB 8/93) and inoculated 
into additional S. mystax tamarins to generate a second 
pool of acute phase serum (GB 2/94) . Both serum pools 
contained approximately 10® GE/ml of GBV-B and GBV-A. A 
10% liver homogenate (CT 11/91) was prepared from a S. 
oedipus tamarin which developed hepatitis following 
inoculation with the twelfth passage of the GB agent. 
The titer of GBV-B in the liver homogenate was 
approximately lo'' GE/ml. The GB 2/94 serum and CT 11/91 
liver samples were used as GBV-B cloning sources in the 
present study. 

Inoculation of eight S. wystax tamarins with 
ten-fold serial dilutions of the GB 2/94 pool 
demonstrated that its infectivity titer of GBV-B was 10® 
tamarin 50% infectious doses (TIDso) (Fig. 2) . The five 
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GBV-B infected tamarins all developed acute resolving 
hepatitis characterized by early appearance of viremia 
(weeks 1 or 2 p.i.), peak viral titers of 10''-10® GE/ml 
and clearance of viremia after 9-16 weeks (Fig. 2) . Two 
5 of these tamarins {S. mystax 769 and 777) were infected 
only with GBV-B and were negative for GBV-A and GBV-Asm# 
whereas the other three tamarins were infected with both 
GBV-B and GBV-Asm. A 5. laystax tamarin inoculated with 
the liver homogenate also developed acute resolving 
hepatitis with peak GBV-B titers of 10^ GE/ml and 
clearance of viremia after 11 weeks. Likewise, four S. 
wystax tamarins inoculated with dilutions of the GB 8/93 
pool developed acute resolving hepatitis with clearance 
of the GBV-B virus after 11-26 weeks. Thus, GBV-B 
infection in 5. mystax tamarins is characterized by 
acute hepatitis, early appearance of viremia, high peak 
viral titers and viral clearance. 
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Example 2 
Novel 3* Terminal Sequence of GBV-B 



The consensus sequence of the complete 5 ' UTR 
of GBV-B (nucleotides 1-445) was deduced from 13 clones 
containing nucleotides 1-283 and 3 clones containing 
nucleotides 31-445. In addition, the entire 5' UTR 
sequence was determined by direct sequencing of the 
amplicons. The sequences of the various clones were 
highly conserved and the consensus 5 ' UTR sequence of 
30 GBV-B from this pool was identical to that of the 

previously published sequence for GBV-B (Simons 1995a) . 
It is noteworthy that 13 of 15 clones analyzed from the 
rapid amplification of cDNA ends (RACE) procedure 
contained the published GBV-B 5* terminus (A residue) 

35 
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and that the same 5 * terminus was obtained whether the 
5' RACE was performed with dC or dA tailing. 

The consensus sequence of the ORF (nucleotides 
446-9037) was determined by direct sequencing of PGR 
products obtained using long RT-PCR (Yanagi 1997) . In 
addition, 3 clones containing nts. 446-7135 (one of 
these clones had a deletion of nts. 3036-3636), 2 clones 
containing nts. 2019-3373, 5 clones containing nts. 
7151-8261 and 7 clones containing nts. 7521-9037 were 
analyzed. The sequences of GBV-B clones in this pool 
were very homogeneous. Evidence of micro-heterogeneity 
was found at only 70 (0.8%) nucleotide and 36 (1.3%) 
amino acid positions, scattered throughout the ORF. The 
proportion of amino acid positions with heterogeneity 
ranged from 0.5-3.2% in different putative gene regions 
(lowest in NS3 and NS5B; highest in E2 and NS2) . The 
GBV-B ORF sequence differed from the published sequence 
of GBV-B (Simons 1995) at 34 (0.4%) nucleotide and 12 
(0.4%) deduced amino acid positions, respectively (Table 
1) . 
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Table 1 



Nucleotide and amino 


acid differences 


among 


GBV-B 


(Simons 


1995a) the 


consensus sequence 


of GBV-B 


recovered from a virus 


pool used as 


the cloning 


source 


<GBV-B, 2/94) and the 


infectious 




clone of GBV-B (pGBB) . 






Genomic Region* 


Position 
nt [aa] 




Nucleotide 






Amino Acid 








GBV-B 






GBV-B 






GBV-B 


2/94 


pGBB 


GBV-B 


2/94 PGBB 


5' UTR (1-445) 














C (446-913) 














El (914-1469) 


1030 


c 


T 


T 






E2 (1490-2641) 


1498 


T 


C (t) 


C 








162B 1395] 


G 


A (g) 


A 


V 






2552 [7031 


G 


A (g) 


A 


D 






2562.2563 


C.A 


A,C 


A.C 


P 


H H 




(706) 














2566 


T 


T 


T 








2625 (7271 


C 


T 


T 


A 


V V 


N52 (2642-3385) 


2647 


C 


T (C) 


T 








2816 [791] 


c 


T 


T 


L 


F F 




2855 [804] 


A 


G 


G 


T 


A A 




3235 


A 


G 


G 






N53 (3386-5125) 


3475** 


C 


C (t) 


T • 








3760 


c 


T (c) 


T 








4114 


c 


T 


T 








4117 


c 


A 


A 








4177 


T 


c 


C 








4615 


c 


T 


T 






NS4A (5126-5290) 














NS4B (5291-6034) 


5329 


c 


T 


T 








5332 


T 


C 


C 








5350 


A 


C 


c 








5455 


c 


T (c) 


T 






NS5A (6035-7267) 


6413 
(1990] 


T 


A (t) 


A 


L 


N (L) M 




6577 


G 


T 


T 








6690 


T 




Q 


I 


T(I) T 




[2082] 














6965 


T 


C (t) 


c 


S 


P (S) P 




[2174] 














7015 


A 


G (a) 


G 








7128 


G 


A 


A 


G 


E E 




(2226) 














7138«* 


A 


A 


G 








7142 


A 


G 


G 


T 


A A 




12233) 












NS5B (7268-9037) 


7282 


T 


C (t) 


C 








7849 


C 


A 


A 








7852 


C 


T 


T 








8942 


G 


A (9) 


A 


V 


I (V) I 




(2981) 














8971 


T 


C 


C 








9026 


C 


T (c) 


T 






3' UTR (9038- 


9067 


T 


C 


C 






9399) 
















Poly(U) 


27 nts 


11-23 nts 


23 nts 








9134 


Deletion 


c 


C 








9141-9399 




259 nts 


259 nts 







"Nucleotide positions corresponding to pGBB. Putative gene borders defined as 
suggested by homology with HCV <Muerho££ 1995) . No homology was observed at the 
NS2-NS3 junction. 

••Positions that differ between the cloning source (GBV-B 2/94) and the 



infectious clone of GBV-B (pGBB) . The change introduced into pGBB at position 
713 8 introduced an artificial Sail site, nd: Not determined. Nucleotides and 
amino acids shown in parenthesis were found as a minor species in the cloning 
source (GBV-B. 2/94 



0075337A1_I_> 



wo 00/75337 



PCTAJSOO/15293 



- 25 - 

The sequence for the 3 * UTR is shown in Figure 
3. Additional 3* UTR sequence was initially identified 
by performing RT-PCR across 5 * -to-3 • -end-ligated viral 
RNA extracted from serum. In all 4 clones with GBV~B 
sequences, the 5* UTR was truncated compared to the 
published sequence (simon 1995a) . However, whereas one 
clone (29c) had the exact 3* terminus previously 
published by Simons et al . (Simons 1995a), the three 
other clones {29a, 29b, 29d) had 150 additional terminal 
nucleotides. Compared with the published sequence, all 
four clones had a single nucleotide insertion (C 
residue) at position 9134. Next, RACE using dC-tailing 
only was performed on the 5' end of the negative -strand 
RNA extracted from the liver homogenate. All 11 clones 
analyzed had additional sequences at the 3* terminus. 
Compared with the published GBV-B sequence, two clones 
(gb6, gb23) had 259 additional nucleotides, 8 clones 
(gb9, gbl9, gb20, gb21, gb24, gb25, gb30, gb35) had 236 
additional nucleotides and 1 clone (gbS) had 232 
additional nucleotides. Moreover, all of these clones 
had the insertion at position 9134. The 3' UTR 
sequences among the various clones were highly conserved 
(Fig. 3) . To demonstrate that the terminal 22 
nucleotides found only in clones gb6 and gb23 existed in 
circulating viruses, RT-nested PCR was performed on 10- 
fold serially diluted RNA extracted from the serum pool 
GB 2/94 using an RT and external antisense primer 
deduced from this sequence. GBV-B RNA was detected at a 
dilution of 10"' and the sequence of the amplicon was 
identical to the sequence recovered from the liver 
homogenate. Thus, the 3' UTR of GBV-B consists of a 
short sequence of 30 nucleotides followed by a 11-24 
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nucleotide- long poly (U) tract (single C residues were 
observed in GBV-B from the liver homogenate) and a 3' 
terminal sequence of at least 309 nucleotides. The new 
GBV-B 3* UTR sequence did not have significant homology 
to any of the sequences deposited in the GenBank 
database. A prediction of the secondary structure of 
the 3' UTR sequence is shown in Figure 4. The most 
notable feature of the secondary structure is a highly 
stable stem-loop structure at the very 3' end consisting 
of 47 nucleotides. 

Example 3 

The pGBB Clone of GBV-B is Infectious in vivo 

The infectivity of RNA transcripts from the 
consensus clone pGBB5-l which encompassed only the 
published GBV-B sequence (Simons 1995) was first tested. 
Within the GBV-B sequence there were no deduced amino 
acid differences and only 2 nucleotide differences (at 
nucleotide positions 3475 and 7138) between the 
consensus sequence of the cloning source (GBV-B 2/94) 
and the sequence of pGBB5-l clone. In addition, the 3* 
UTR of pGBB5-l had a deletion at nucleotide position 
25 9134 and was missing the 3' terminal 259 nucleotides 

(Fig. 3) . Prior to transcription, the pGBB5-l clone was 
linearized at the BamHl site with digestion at the exact 
GBV-B 3* terminus. The RNA transcripts from pGGB5-l 
were injected into the liver of two tamarins (S. mystax 
797 and 815) . GBV-B RNA was not detected in weekly 
serum samples collected during 17 weeks of follow-up. 
As the susceptibility of these two tamarins to GBV-B was 
subsequently demonstrated by experimental infection 
35 using a GBV-B virus pool, the consensus clone pGBB5-l 
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which lacks the 3 ' terminal sequence of GBV-B is thus 
not infectious in vivo. 

Next, the infectivity of RNA transcripts from 
the full-length consensus GBV-B cDNA clone pGBB was 
tested. The pGBB clone was identical to the pGBB5-l 
clone except in the 3' UTR. Thus, in addition to a 5 * 
UTR of 445 nucleotides, an ORF of 8592 nucleotides 
encoding 2864 amino acids and a 3' UTR of 103 
nucleotides, the pGBB clone also contains an additional 
259 nucleotides in its 3' UTR. pGBB was linearized at 
the Xhol site which added an additional C residue at the 
3* end of the transcribed GBV-B RNA. When RNA 
transcripts from the pGBB clone were injected into the 
15 liver of two tamarins (S. mystax 816 and 817) , both 

tamarins became infected with GBV-B with viremia at week 
1 p.i. and peak viral titers of 10® GE/ml (Fig. 5). The 
consensus sequence of PGR products of the complete ORF, 
amplified from serum obtained during week 2 p.i. from 
one tamarin (5. mystax 817) , was identical to the 
sequence of pGBB, including at the two positions which 
differed from the consensus sequence of the cloning 
source and from the published sequence of GBV-B (Table 
25 1) . By performing RT-PCR as desired above, it was 

demonstrated that the very 3 • terminal GBV-B sequence of 
pGBB existed in the circulating viruses in this tamarin. 
Within two weeks of the transfection both tamarins 
developed hepatitis with dramatically elevated liver 
enzyme levels (Fig. 5) . Thus, the pGBB clone is 
infectious in vivo whereas the clone pGBB5-l which lacks 
the last 259 nucleotides was not. 
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WHAT IS CLAIMED IS; 

1. An isolated nucleic acid molecule which 
encodes GB virus-B, said molecule capable of expressing 
said virus when transfected into cells. 

2. The nucleic acid molecule of claim 1, 
wherein said molecule encodes the amino acid sequence of 
SEQ ID NO: 2. 

3. The nucleic acid molecule of claim 2, 
wherein said molecule comprises the nucleic acid 
sequence of SEQ ID NO:l. 

4. A DNA construct comprising a nucleic acid 
molecule according to claim 1. 

5. A DNA construct comprising a nucleic acid 
molecule according to claim 3 . 

6 . An RNA transcript of the DNA construct of 
claims 4 or 5 . 

7. A cell transfected with the DNA construct 
of claims 4 or 5. 

8 . A cell transfected with RNA transcripts 
of claim 6 . 

9. A GB virus-B polypeptide produced by the 
cell of claim 7. 

10. A GB virus-B polypeptide produced by the 
cell of claim 8. 

11. A GB virus-B produced by the cell of 

claim 7. 

12. A GB virus-B produced by the cell of 

claim 8. 
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13. A GB virus-B whose genome comprises a 
nucleic acid molecule according to claim 1 . 

14. A GB virus-B whose genome comprises a 
nucleic acid molecule according to claim 3 . 

15. A method for producing a GB virus-B 
comprising transfecting a host cell with the DNA 
construct of claims 4 or 5. 

16. A method for producing a GB virus-B 
comprising transfecting a host cell with the RNA 
transcript of claim 6 . 

17. A composition comprising a nucleic acid 
molecule of claim 1 suspended in a suitable amount of a 

15 pharmaceutical ly acceptable diluent or excipient. 

18. A composition comprising a nucleic acid 
molecule of claim 3 suspended in a suitable amount of a 
pharmaceutically acceptable diluent or excipient. 
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19. A nucleic acid molecule comprising a 
chimeric virus genome, said genome being a GB virus-B 
genome according to claim 1 in which a 3' or 5' UTR 
sequence of the genome is replaced by a corresponding 
region of the 3' or 5' UTR sequence of a hepatitis C 
virus genome . 

20. The nucleic acid molecule of claim 19, 
wherein a 3' UTR sequence of the genome of a GB virus-B 
is replaced by a corresponding 3' UTR sequence of a 
hepatitis C virus genome. 

21. The nucleic acid molecule of claim 20, 
wherein the 3' UTR sequence is the 3' UTR terminal stem 
loop sequence . 
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° 22. The nucleic acid molecule of claim 19, 

wherein a 5' UTR sequence of the genome of a GB virus-B 
has been replaced by a corresponding 5' UTR sequence of 
a hepatitis C virus genome. 

5 23. The nucleic acid molecule of claim 22/ 

wherein the 5' UTR sequence is the IRES sequence. 

24 . A nucleic acid molecule comprising a 
chimeric virus genome, said genome being a GB virus-B 
10 genome according to claim 1 in which the non- structural 
region of the genome of a GB virus-B has been replaced 
by the non- structural region of a hepatitis C virus 
genome . 

J J 25. The nucleic acid molecule of claim 24, 

wherein at least one gene from the non- structural region 
of the genome of a GB virus -B has been replaced by the 
corresponding gene from the non- structural region of a 
hepatitis C virus genome. 

20 

26. The nucleic acid molecule of claim 25, 
wherein the gene from the non- structural region is 
selected from the group consisting of NS3 protease, NS3 
RNA helicase, or NS5B RNA polymerase. 

25 

27. A nucleic acid molecule comprising a 
chimeric virus genome, said genome being a GB virus-B 
genome according to claim 1 in which the structural 
region of the genome of a GB virus-B has been replaced 

30 by the structural region of a hepatitis C virus genome. 

28. The nucleic acid molecule of claim 27, 
wherein at least one gene from the structural region of 
the genome of a GB virus-B has been replaced by the 

35 
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corresponding gene from the structural region of a 
hepatitis C virus genome, 

29. The nucleic acid molecule of claim 28, 
wherein the gene from the structural region is selected 

5 from the group consisting of El, E2 or C. 

30. The nucleic acid molecule of claim 2B, 
wherein the El and E2 genes from the structural region 
of the genome of a GB virus-B have been replaced by the 

10 El and E2 genes of a hepatitis C virus genome. 

31. The nucleic acid molecule of claim 28, 
wherein the El gene from the structural region of the 
genome of a GB virus -B has been replaced by the El gene 
of a hepatitis C virus genome. 

32. The nucleic acid molecule of claim 28, 
wherein the E2 gene from the structural regions of the 
genome of a GB virus-B has been replaced by the E2 gene 
of a hepatitis C virus genome. 

33. A DNA construct comprising the nucleic 
acid molecule of claims 19, 24 or 27. 

34. An RNA transcript of the DNA construct of 

2^ claim 33 . 

35. A virus whose genome comprises a nucleic 
acid molecule according to claims 19, 24 or 27. 

36. A nucleic acid molecule comprising a 

30 chimeric virus genome, said genome being a hepatitis C 
virus genome in which a 3' or 5' UTR sequence of the 
genome is replaced by a corresponding region of the 3' 
or 5' UTR sequence of a GB virus-B genome according to 
claim 1. 

35 
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37. A nucleic acid molecule comprising a 
chimeric virus genome, said genome being a hepatitis C 
virus genome in which the non- structural region of the 
genome has been replaced by the non- structural region of 
a GB virus-B genome according to claim 1. 

38. A nucleic acid molecule comprising a 
chimeric virus genome, said genome being a hepatitis C 
virus genome in which the structural region of the 
genome has been replaced by the structural region of a 
GB virus-B genome according to claim 1. 

39. A polypeptide encoded by the nucleic acid 
molecule of claims 19, 24 or 27. 

40. A polypeptide encoded by the nucleic acid 
molecule of claims 36, 37 or 38. 
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GOCflGcxra: lUflSiOGGoac Gftcaciaac GAroftftrcac T oo ajiu xc iA so 

GGAACEACIG ' ILTICA OGCR. GAAftGOGTCT AQ0CA3G333 ITOGIMGfiG 100 

' i G'ica iG caG uc' x oc fl Gaac oxajuiooc oeaGftGaoocA Ta mojiuiu iso 

OQSAAOGdGn:' GAGI3O0CG GAATIOGCAG GA0GAC0333 'iULTi'lLTlU 200 

GAZAAAOGGS CICAATOOCT GS^SOTIGS ULUIGGGGGC GCAAGACZGC 250 

TaOODGflGm GlUi'iUmiU OOGAAAG9X TiUllULACr GCXnGKQ^ 300 

GI U L TIU UL A gIGGCD33GS AGG>TCI03m GAOliiGCAC CATCAQCRCI3 350 

AMDCITAftC CICAAflGAAA AAOa^AMST AAOWDCAftCC GIOOOOSO. 400 

QGAOGTCRftG TlUUULOaiG GOjaiC ft GaT (JLji ' lUL^jlUGA . Gi ' l ' mji'lUl' 450 

iGOoaoscaG ocmnaGA Tiumiui GC GogaaoGflG GRflGR LTiUL ' 500 

GAGGQ3I03C AAOJiOSAGS TAGAaSIC3G (JL'UftlU3XA. ASSGAOSrOQ 550 

QOCXX3A03QC AGSAOnOSS CICMXXXDS G?I3^L)ULTiUlj CJOGCiUiyOG 600 

GCAftTOAoaG 'i'iuaax,'iG3 amsftior Tocroicroc cjuljiggcjict sso 

OQQOCERQCT QQQQOOOCAC AGftCXXXXJGS OSIRGGnm: QGAATnOOS 700 

TJ^AOGflCAIC GM3m:TlA CUiUJUULTr CJGOGGAOCIC MX3333raCA 750 

TaoDGCTOGr oaGorGCCT cTiutj ftQ3CX3 ciocxaGax ociucn rar 800 

QQC3GICCQ33 TICIGSAfiGA 0QQ03IGAAC TKri3CM>CPC GCSAAULTiUJ 850 

IGGTIGCTCT T lL'lUimL'i' TOCTICIGGC: OCTOCICTCT TOXTGACIG 900 

1Q0Q0QCTIC A3cxmx3A GiQoacAflrr ocroG333CT TmxaiGic 950 

AOCAAIGATT GULUiAftCIC GAUim'iUXG ITOjAGQGQS GQGAaODCAT 1000 

cciGcacAcr ujuGc m ui G ' lUJL ' i ' iuuur TOOOofiGaGr aaogocioga loso 

GGflGnOOGT GQOQGIGACC 0QCA03GfIQG OCAOCfiQQoA. (DQOCSWOC 1100 

CQCACAAOQC ASCTICGaOG TCATATOGAT CiUL'i'iUlUG QGM333XM: 1150 

anciQCTOG QOOCiciaQS TOGSOGfiGcr Gam933icr GrcmcnG 1200 

TI03ICAACT (ji ' i ' i!ftULTiL ' TCTOCCRGQC GOCACIQSAC Gfi q3Cftft3ft C 1250 

TQCAAli'iWlT CEMCEKnX 0000302^^ A3X3GICMC GiM03CMG 1300 

OSAaSOGPiTC MGAACIOGT ODOCI^WDQX A333nG3IG GEftGOOGC 1350 

IGCIOCO G AT aXSOVfiGOC AICRIGGfiCA IGftlOXTOS TOCICRCTOS 1400 

QGftGIOCIGS OQC33CAaaGC Uim'i ' lUiUL ' AIG3IG333A ACI0333GAA 1450 

(jU i CC l Ggi A G ' lU.'iUL ' iUL: i mTlUUO QS CU i UGA OaCG GAAftOOCaOG 1500 

TCftOOGGGSa AAKTGOOGGC aSCaCJCSmS CiUUJJi ' iUr TSGflCroCrr 1550 

ACAOCAG333 OCaAOC^GAA. CAIOCAACIG AICAflCAOCA AUUULWJi'iU 1600 

GCACATCAAT AQCAD3QCCr TGAATIQCAA TGAAALiJL'iT AACfia333Cr 1650 

UJriA QCaG3_ GJiUi ' iL ' i:A T CAACRCAAAT TGft^'iLTiU A3QCIGICCT 1700 

GAGAQGTIGG OJftX'iOJaS AUUUL TIIA OC GRTrnQOCr AQQQCIQQQG 1750 

lU-'imCAGT TS^TOXAAOS GAAGQQGGCT GGPDGAAD3C COJi^CiGCT 1800 

GQCACrAODC TOCAAGADCTT TSIOaCATIG TOGCOXJAA GAULUlUiGr 1850 

OQ00CO3IAT ATiacnCAC TCOCAQQOCX: GTGGTOSIGG GAA0GAC3CGA 1900 
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CAGGi03GGC ULULL'iAOCT iOGCIGQQS lOCAAASIGAT ACXSGAIUILT 1950 

HOSSXXJTlMi. CAACACCAGS CXaOJUL'lGS OC3^ATIGC3IT COSmSIPCC 2000 

TOGKIGAftCr GAACIQGftTT CfiOCS^AAQIG TQOQSmSC OULTiUiUi ' 2050 

CA3a3GAQ3G GIG93ZAACA. ACAOCTIQCT CiaOXCACT GKi'iULTiUJ 2100 

GCAAAcaia: osAfiGoczo^ Tfiacroosr ojuxT O G G a TOOdOGftar 2150 

ACAOGCT^OSr OL m Oa i OG A . CiaCGOSraT i^GSCnriGX A H ' IMULTIU 2200 

TPCClfrCMir IMJ^XKHKT HCMfGICPG GAIGTAOSIG G3^3GiajlUj 2250 

AGCACftQQCr GSftflGOGGOC IQCAACroSA. 03CG3GGa3A. AOGCIGi Uft T 2300 

CIOSAAGftCA. GQGRCaGGfIC OGAQCICfiQC (JLUi'iUL'iUL' TOICXaOCftC 2350 

ACAGIGQCftG GlU L T lO m ' GTlUmCAC GfiOX'IGOCa. tJULTlUiULA 2400 

CXDGGGCICaT OCAOdOCAC CAGAACATIG TG303IGCA, G'i!ALTlUrAC 2450 

OgOGTC^GGGT CAAGCAIOQC mOl 'l U G mi ' MOAAGIGSS AG EflOjllUr 2500 

' lUiUC ' lUITC Ci ' lL ' iUL'l ' lU C2«303GQ33 OGTCIGCTCr ' iUL ' i ' iUIUA 2550 

TGft lUi ' -LA CT GAIMOCCAA 0333^03333 CVL'VJJfiC/>A GCiLGlAMA 2600 

CTCAATOZftG CftliaXIGGC CmaOSCAC QSICrroiGT (JL ' i'iUL ' iUUi ' 2650 

GTicnciGC Ti ' iumiuar AiciGAfloaG aaoGaraQSiG ccmaoogs 2700 

TCIACXXXXT CDVOaOGKIG 'IG30aCICC ICC'ICJCICC T C Cllja G bTlU 2750 

GCICAQCX3G3 CAmX3CACr QGACTmSfiG (JiLmiGOiT GG/iLJlGUULG 2800 

aji ' iUl ' lL ' i'i' GTOOOCSmA 'iGGCUC'iGAC TCIGTOQOCA TATI30\ftQC 2850 

GCEAIftTCMS CIGSiaZKIG IQGIQOCTIC AL/mTi'lUi' GAa3^3RG?IA 2900 

GAAGQQO^AC TGCR LUmm aji'iUJUUa: CICAft miUC GOaGGGGaOS 2950 

CGAaOOOGIC AICTEACICft. TGICflGEftGfr ACftOOOGAOC CiUGlMTlU 3000 

ACAICAfXAA ACI^L'iUl'iG GOCATCTiaS GftOCXXTTIG GKi'iUi'lUAA. 3050 

GQCAGnTGC TD^AftGIOCr CDOTDGIG OXCTTCAAG CJULTiUlUULJ 3100 

GMCn30303 CiaGGGGGGA. AGMaGOGGG AQGICRTiaC GIt3C?VAKrQG 3150 

GC?mZA:iCAA. GITZ^GGGGOG CTE^CIGGCA OJIMUiUlA OAACCAICIC 3200 

AODOCICnC GAGACroOQC QCACAAa33C CIOOGAGAIC 'iUJULUlULC 3250 

IGIGGAftOZA. UIUJILTILT ODOS^KIGSA. GADC3VAQCIC MCRCCfIG33 3300 

QOGCAGftlRC OQaaQOGIQC GGIGftCATCA. ICAALUJL'iT QOaCGICICr 3350 

Gajum W 333 GOCAGaGKr ACIQCTIQQG GCRGGCX3fia3 GAflaOGICIC 3400 

CftftOGGGIGG AGGTIGCIGG GQCGCKrCAC CLULjilAOGOC CAGCAGAOS^ 3450 

GftQQCClCCT W333IC?I3fflA. ATCADCAQOC TCACIGGCOG GGJCAAAAflC 3500 

CAAGIQGA03 GIGAQSTOCA GRTOGIGICA ACIQCTfiOOC AAAOCTIOCT 3550 

GGCAAOncr jmiAAIOaQG TA3^^ 'i'JiSiftOCftC GQGQOQGSAA 3600 

CXaOGftOCAT OGG K lCftOOC AAQQGIOCIG TCATOCaGRT GISflaGCAAT 3650 

GIQGftC3CAAG A0CTIGIGQ3 CIGGOOOXT OClCAflQGIT COOaCICRTr 3700 

CaCACOClGr ACCIGLUJCT CCICGGftDCT TlftOCTQGIC PnS>£X3CM33 3750 

(DOaviGICAT TCOOGIGOQC OQQOGftGGIG AIM3CRQQ3G TSLLUiULTi' 3800 
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^iQ:ujja^- ocarriDcnR cnGftAflG9c T oi ' iuuu y LjLj giuuul - iuit Siio 
ctgjqxojl; osorax33 -KmriMT amaooacE Groiocsm: 3900 

GIQ3RGIQC3C TAAAGCX23IG Gft UlTimUJ CIGIOSfiGAA OCDmaCA 3950 
MXX TGMSKT OaOOGSIGIT CaCDSftCAAC ICCI C i U CftC CfiC33GIG0C 4000 

ccRGAQcnc GBoauLUx: AXTOcm: Taxaooggc AoooGfiaAcsi 4050 

GcaocAaQsr ooGaanro laoacftGcxr AcmzmoA G aiurmjii? 4100 

cieaACorr aunujiuj AaoarTOr Tnu j i uj iT Acaroio^A 4150 

^raaoas GTiGftrocm Maicaosftc O G G GG n a m acwotocea 4200 

<Jiv->:<. rt 33 x caaoosiac Tocaociaos GGaagncCT TOooGaaagG- 4250 

G3snajii;flG GAGsiGCTm TxspcmKEk ATircmaaos ^aGTOULftL'iu 4300 

cAossmrc acaicooct TOGGC A ioaG cftcrorocrr oraAocaG 4350 

A^CTQ3Q3G QSOGa GACIG GITGIOCTCG OCACIQCmC OGC' i UUUCJJC 4400 

1\,UiiuACIG TGIOOCai CX: TAACATO3AG GflQGTIQCIC TCICCftOCAC 4450 

OTGAGAGMC CLLT l'l'iao S QGAAQSrEAT OOOOCTOGAG GIGATCAAQ3 4500 

Q^SSAAGACft. ICTCAacnC TSX A CICAA, A3AAGAAGIG OSAOSfiQCIC 4550 

QOraOGAAGC TOGiroCATr G3CrMCAAT GJGUIUUULT ATTOi T i njj 46OO 

.n,xi LiACGIG ICroiCATOC O3AaaG033 OGAIGTIGIC GlUjlUlUA 4650 

CXDGAIQCICr CAJGAC IGGC TTraomX ACTTOGACIC TGIGAaS^GAC 4700 

IQCAACAOGT GIGTCACICA GACAGTOGAT TTCAQDCITG ACOCmnT 4750 

^D^OCATIGAG ACAAOaaSC TOOOaaOGA TOCT U iUiUL- ASCaCICaAC 4800 

Q^^^CAG GACIGXAG3 GC33AAGX3«3 OCSmnMaC Ai ' i ' lUiUjCA 4850 

C3Cm333fiGC QCCCCKXX3G CATCnOSC TGG l CmiUJ 1CIGIGAGIG 4900 

CIAH»03D3 GXTCIQCrr GGIATCfiGCT CAQ30Ca3GC GA(30J0G 4950 

Tmo^mjG f£xxm cKiG AACAoxnas ajui ' iuumi ' gigocaggac 5000 

CA^I CTIGAAT Tn033AG3G OGflLTl ' ia OS G G O CICA CIC AaaaftGAlQC 5050 

ocAcrnrm tcocagacaa. aqcagsgios Q3Ac»AcnT a .' 1 ' iaa. ' iUi 5100 

13m3naGCA A33ZA1 XG'1U 1033213^3 CICAAOQXC TODDOCAICE 5150 

TO93A0CAGA 'lUiUjA AGIG TnGAIODX CITAAAOOCS^ O O C HJUA TOS 5200 

QOCAACAOOC CIGC'JATACA. GACIQ3QCGC: ' iUrHJA GAAT GAAGICAOOC 5250 

TI^CTCTOC AAICAOCAAA. laCATCAIGA eAlGCAiiUiU 033a3AaC3G 5300 

GA0C3IO3ICA a3AQCAOCT 3 ajiUL'lLUi'i' (3333333102 TUGJiUL ' iLT 5350 

Q^^^^^^^ TQOCIGTCAA CAOGCIOCT QGnCATAGIG a3CW33AI03 5400 

'i ^i i^i^JU sJ GAAGOCGLCA AmaMCIG JOOSSAOGT TCICUmvS 5450 

GAGITCGAIG AGAIGGAAGA GflQCICICAG GACnaOOST AGAiaaOA 5500 

AQ03A3^G CIOSZTGAOC AGITCAAQC?. <Sr'J 03 3X' l C QOCCICCIXX: 5550 

AGADCQOGnC 0030CAIOCA GAGSTIMCA CQlL-'lUL'lUi' OGAGADCAAC 5600 

■roa^^^GAAAC TDGAGSICIT TIO3303A;fi CACAIGIOSA AmCATCAG 5550 

TOGGA-DCAA TACTTOOQG QOCIGICAW: GL'lU.L'iyLrT AAOOOCGLXA 5700 
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TZGCnCKET GAIGQCTnT TCAdCSGOOC fdMOXr 5750 

Ggoc3tftfm: ' icuiui ' iuA aa m'ioaGa ojuiuumu s ciojULa scr 5800 

CUXGHiriT C m UCLUL 'JA CiUUL'mUi ' aGCUUClGGC CraOCIQSDS 5850 

CX3XCKICX3G CftGCUi'iULA CIQ3QGAMG 'lUL'iUilUGA. Ca i'lLTlULA 5900 

<JLXJim\3336 03Q9CX3IG3C G33a tjL ' iUiT GraGCATZC^ A3AIC»!n»G 5950 

CUL^IGAGQIC OnCCmSS iOSADJIGGT Ca KIUiUL ' iU aXXXXKICC 6000 

TcroQcnos Axaji'iuiA uiauiuxuu 'iciimm: AnnoGOQc 6050 

aJDU ft OJi'l G GG0CX333CX3\. GQSgXAGIG CAATOSmS^ ^0393:^3^ 6100 

Axxnroax lamaaoGA. AULmumc cxxxaaazftc TaaGioaaas 6i50 

A30GC3Cm32 GICACIGGCA O L' ICA C' iU i A 6200 

AOOCftGCiaC IGAGOGSO* GC MUAGlULi Aaaft X ' lUGS ASKSmOCS^ 6250 

IQSIGCIOC Ub'i'iUL'lUSC 1AM3QGACKr CIG33ACTG3 i\3MGGSAG6 6300 

T3CIGA003A CTmAGADC TQ3CIGAAft3 CGAflO^GKr G0C3OACIG 6350 

CCIGQGATIC OCrrroiGIC CIQCEA03QC G3GIKIg033 QGGT C 'iUULG 6400 

i^OGAGACGGC Ai'iMULACA. Croa JiGO C A . cmiUG ftGCT GAGAl LA C iG 6900 

GACAIGICA^ AIGAGGAICS ' lUmiLCia G GMJL'iUJKC 6950 

AACATOIGGA GflGGCaCGIT (DOXfiTEAAC GOCECACXS^ caGGCaJC'lU 6550 

TACroOOCIT CCT30QCX33A ACEaS^AGIT (JULUL'lG'iGG AUULJiUiUiU 6600 

CAGAGGAAIA GGIOGAGAaA AG30333IQ3 Q33ALT1UCA. CEALUimULi 6650 

OGrraiGACIA CIGACAAaXIT TAAAIGCaE T032AGAIGC OOCmXGA. 6700 

AlTiTlLACA. GAAnOGAOS GGGaGCUG L T ACACAQSTIT UUUULUULTr 6750 

QCAAUJUL'iT GnGG393AG GAQGOAICST TCAGAGTAGS ACIGCAaSAG 6800 

mXXX3GfI03 03I03CAATr AOCTIQOGfiG OCOSAAOOGS AOSEAQOOSr 6850 

GTIGADSIOC ATOCICACIG AJ OX HJO CA . TAaS^ACAQCA GAQSGGGCCG 6900 

GGAGAAQGTT GQOSAGAGQG TCA OJUULTi ' CEA I GGOCAG C iUCiLUXT 6950 

AGCXaOCror OCGCroCftTC 1CIC3»G3CA A LTIUCA OOG OCAACEfillGA 7000 

CTCOOCIGAC GOOGAGCICA. laGBGSCTAA CLlUJiUlUi -AQQC303AGA. 7050 

TQC33aQQCAA CA30aaG3 GTIGAGICAG AGAACT^AAGT G3IGATICIG 7100 

GAL'iULTlULi AiUJULTiUi' G02AGAGoW3 GAIGAGOGQS ;03ICICXD3r 7150 

AOCIGCAGAA M'lUlU0a3A. AGICTCQGAG Al'lCGOGQ33 GCLCIGGUGG 7200 

TCTQQQQQOS GOOgSACTAC AAOG0CCO3C TAGiaGAGAC GIG3AAAAAG 7250 

OCIGACTSDG AAOaOCIGr QGIC3C3aG3C 'iULmjC'EAC CAlJL'iUCACE 7300 

GIOJUL'iULT G lOX ' l CGCjL' CI033AAAAA. QOSDOXrEG GTOCICAOOS 7350 

AATTCAAOXT AlCrAL'iUJL' T l GGCUa^ ' i ' U yC A OCAA AAGTinGX 7400 

AGCroCICAA CrroOQQCKr TM3333CGM2 AAmXSACAA CAU.UL'iCiGA. 7450 

GOCXDOOOQCT TCIGQCIGOC OOOOOGACTC CXSAOGTIGAG TOCEATICIT 7500 

OCATODOQCX: QCTOSA0333 GAGCX7I0333 ATOOGGMCT CAGOSADGSS 7550 

TCAJIOGTCGA OSSICAGIAG TaOQGOOGAC ADQGAAGAIG TOGIGIQCIG 7600 
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CTCAAiiGicr aarrociGSA. cfiQ3aoacr ogicadoocxs tgogc'iuulg 7550 

AAGAACAAAA ACIQOCCfiTC AMI3CaCIGA. GC^ACIOGJIT GCIRa3a3T 7700 

CACAAICI03 'iUlM'lUJAC CACnCfla3C ASIQCnOOC AAflGGCRGAA 7750 

GAAAGTOOl TTIGACAGAC TQCAAGnCT GSMSOOCKT ISOGAGGAOS 7800 

ItSGOAGSl GBICS^AAGC?^ GGGGQGIC ft A. AAGIGAAGQC laA LTiUL ' iA 7850 

1D03iaGaG3 AAQCnQCaG 0CIG?O30aC OCTOTICfiG GCAAA3CEAA 7900 

GfrnOXTftT Q3C33C3^AAflG AOSIOOGTIG OCAIOXAGA. AfiGQODSraG 7950 

CXX30TCAA CICQOiGlGG; AAAGAOCITC TQGAftQOG IGB^ACRCCa' 8000 

AIAGACACIA OCAiCMGGC CAMSAAOGAG UiTi'iUiUOS TICAOOGIGTV 8050 

GAAG393QSr OSIAAGGCAG CIGSICICAT a^iUi ' iUUUL ' GftOCD3mm 8100 

TOOULUiUiG CGftGAAGRIG GCXXTOISaS ALUiUb'i'liAG CAAGOTODa: 8150 

CD33CXDGfIGA 103GAAQCIC CEft033ATIC C3\AiaCICAC CAGGfiCMCG 8200 

GSriGAAnC CIDSIQCAftG OSIQGAAGIC CAfiGAfiGAO: CXXSAIQQQST 8250 

'icixxjiMGA. mmaciGT TncftCfracA caGicaciGA GAOOGftCMx: bboo 

03iaa3GflQG AGGCAAirm CXaft tiUi ' iUr GftOCTGSaa: OOCAAOCmS 8350 

CG'IUUOJATC AAGiaX'ICA CIGAGAOOT TIMGnGQS G GO -U iU l 'lA 8400 

CXS^ATICAAG G333GAAAAC TOOQGZiaa: GCAOGflOOCG OQCGAQOQGC 8450 

CTACIGACAA CTAGCIGIOG TAACROXTC ALTlUUilACA. TCAAGaOCXDG 8500 

GGCAGCCIGT OGAGOGQCRG UGC'IOCAGGA CIGCftGCATC CI CGiUlUiU 8550 

ODSAOGACIT AGICSSmiC TGIGAAJOG CGSGISOTQIA GGAGGAOQGS 8600 

O3GAG0CTGA. GAGOCTICAC GGfi03CTAIG AXAGGnaCT eXBGCCOJUUL' 8650 

03332?£XXX: OCACAADCaG AMaOGACTT GGAQCTEAm ACA3CATOCT 8700 

CCICCAAOC?r GTCAGTOGOC CAOGftOGGCG CTO3AAAGAG GCSICDCTftC 8750 

CTiaOOCXnG A00C!ISO«C OXXriaXC 8800 

AAGACACACr CX»GICAATr CXnOOOMG CAACaHAMX: MGITIGUOC 8850 

CCACACroiG G3C3GAGGA3X3 ATACIGAIGA CUJAilTlL ' lT UmSIOCIC 8900 

AIAGOCAQQG AICfiGCinGA ACftQGCICIT AACIGIGRGA TCCmaGC 8950 

CIGdJCroC MaGAADCRC TOGftTCEO: TOGAATCRTT CAAftOOOC 9000 

MG30CICM3 CXSGATmCA CJXJCACAGn iOCICCAGS TGAAAaXStffl' 9050 

AGC3GfIG9CXI3 CSOGOdCfiG AAAACTIQQS G IOXOOX T TQCGMCTIG 9100 

GAGACA00CX3 GC30093fiGCX3 TOOaOXTAG Q L 'l'i C i UlCC AGAGSAGCrA 9150 

GQGC'IGOJAT MGIGQCAAG TAGL'lUi'UJA AL-'mJOCSGT AAGAACAAAG 9200 

CICAAACICA. CI03^AIAGC GGCXDOCIGQC CQQCTGGACr ' iUiLUUJi ' iU 9250 

GnCAGQQCT GQCTRCAGOG QQ3GAGACRT TDMCACAGC GIGTCICAro 9300 

CCC030CCCG CnOGfTTCIGS TmOOCTAC 'lOJlULUJGC 1GCAG33GTA 9350 

G3CAICIACC TOCTOOOCAA C03ATCAAG3 Tro33GT3^AA CACTOOOOOC 9400 

ICTTAAGOCA 'i'i'iLL'iUi'i'r 'ri'i'i'i'i'i'l'i'l' ' ITi ' i ' i ' i ' i ' i ' i ' i ' ' i ' i ' i ' i ' lL ' i ' i ' i ' i ' 9450 

TrnTncrr TocnTOCTr gri ' i'iTi ' iLu 'iTici ' i ' mL cxncrnAA 9500 
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MSIMEKFQRK TKFNIllRE^ UJKFB333Q1 VSSf^OU^ (SBU^/BKIR 50 

KISEPSQWG RR0PIFKABR PBSROIPC^ YOMIilSMS 0G»«O4LI£P 100 

PGSRPSWGET DERRRSRNLG KViUJUlUGiF AClMSyiPLV GAED33AARA 150 

IAH3VRVLED GVNBma^LP QCSFSIELIA LLSCL1VPA5 AXQVRNSSGL 200 

•m/INDGEMS SrVXEAADM: lilL'KC V K V RB3NASRCW7 AVIPIVMBD 250 

GKUTTQei^ HTnrfMSag lJCSALy\^aZi OSSVFIX/GQiL FIFSERRHffr 300 

7Q[xn::siyp GHncsiFMTaf osdCftifjSEaA alaa;!aqlii^ FQ^omiD^ 350 

^lH/tJJLfGIAYF3:t73mKVU^nI^^ 400 

OLLTPGaKOT IQLINIKGSW KCNSIMIOJ ESLNIQS'JLAG IFYQHKENSS 450 

QCPEHLA9CR RL1EE2^Q3WB PISYRNGSGL DERP30«YP IKPCGIVPAK 500 

SVaSPVYCPT PSPWWSnD R3C3APIYSWG AMiJIEWFVIN NIRPEDGNWF 550 

QCIWMNSIGF IKVOS^EPCV IOGVOWIHj CflDCERKHP EftlYSE^DGSG 600 

FWITPRCMVD YPYRLMnPC TrNmFKVR MifVGGVEHRL EAAOSWIPGE 650 

Brnr.KnRi3^ ELSPLiiSir qwc^/lpcsft upaisigli HLHasnvrM? 700 

"ifLYGVGSSIA SWAIKWEWV LLFIiIAI»R VCSCLI'^MLL ISQREAAI^N 750 

LVUMAASIA GIH3Q/SFI^/ FFCFT^WmG FMVFSV\/^ Y(Sti1PLUIL 800 

IALPQRM3\L DTEVAASOOG WLVGLWALT I£P!i:YKRYIS WCWWfl/DSYFL 850 

IKVEAQLHVW VPELWVFGC31 EKVUHCW HPTLVFDnK LLIAIPGPUW 900 

IljQ?^SLIiWP YFVRVQGmi ICAIARKtAG C3nVQMAIIK LGMiTGTYW 950 

IJHLTPLED/Jk HNGLPDLAVA. VEPWFSRME IKLTXWSADT AAOSDIINSL 1000 

IVSAPRGQEI llGPNXIfJS KGNRII^n A^T^QQIPGUi QCinSClGK 1050 

EKN3VEGEVQ IVSIMQIFL ATCDCWCWr VYHSAUlRL'i ASFKGPVI(J1 1100 

YINVCODLVG WPAPQGSRSL TPCIOGSSIXi YLVTRHAEVI FVRRR3DSRG 1150 

SLLSERPISVr LKGS9QC3PIIi CEAGHAV3LF RAAVCIR3WA. KAVEFIEVEN 1200 

DSTEMRSEVF HMSSPPAVP QSPQWRHLHA PIG9GRSIKV PAMfAftQGBflC 1250 

VLVEMSWAA HiGFGAXMSK AHSWDENERT GVRl'iTlLSP ITXSnGKEL 1300 

ADG3C93GAy DIIICEBCHS TDKTSIUSIG TVDDQAEEfiG ARUA/IAiaT 1350 

PPGSVIVSHP NIEEVALSrr GEIPF'W3vAI ELE7IKQC2« LIECHSKKKC 1400 

mAAKLVAL CTNAVAYmS m/SVIPrSG EWWSIDRL MiUb'iULtLlS 1450 

VIDCNICVIQ IVreSLDPIF TIEnTLPQP AVSRTg^RSR 1C3^3CPGrYR 1500 

FV7\EGERP9G MTOSSVLCEC YDM3CNfKEL TPABITVRIil AJM^TPGLPV 1550 

CX^HLEFVEG VFIGL1HIDA HFLSQTKQSG ENFPXLVAifQ ArVZARAQAP 1600 

PPSrCgyWKC LIRIiCPIIKS PTPLLYRDSA VQSEVILIHP riKXIMiaiS 1650 

ADLE^A;ISIW VLV33VIAAL AA»XSIQCV Viyom/LSG KEAIIPCREV 1700 

LYQEHDEMEE CSQfiLPraQ GM4EABQEKB KALCUJQIAS FHftEVCTPAV 1750 

grnjfljQKLEVF WAKHM'^INFTS GIQYIAGL5T LPGNPAIASL MAFIJAVTSP 1800 

LTIGQIUiFN ILJ33iflWAAQL AAPGAATAFV GAGLAGAAIG SVC3J3KVLVD 1850 

ILAGYGAGWA GALVAFKIMS GEVPSTE3XV NLLPAILSPG ALWGWCAA 1900 
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ILRRtMSPGE Cf^JQfM^BLL AEASR3NE^ PnOVFESm AARVmZLSS 1950 

LIVIQLLRRL jgWESShl.Tl' PCSGSWLRDI WDWICEWCSD FKTWLKMOM 2000 

FQLFGIFFVS Q^^GUPGVNR GDGBdHIPCH OSVEXZGB/K NGQURIVmi 2050 

ICRNd^SGHEF FINKmGEC TPLEABnKF AUHFMSAEEJf VEIRPM3XH 2100 

YVSGMTSCNL KCK3QIPSFE FFIELDGVRL HRET^FFCKPL IPEEVSFFM? 2150 

LHE^^SSQL HJhPEtlWAV LTSMDIDPSH TIAEAAGRRL ARSSPPSMAS 2200 

SSASQLSAPS IKaiCEaMD STOAELEERN III^RQEM3GN i 'lK V ESENKV 2250 

VnCGFDPLV AEEDBE^EVSV P&PTTPtfgpp E2VRALE^A^ lOOIPPLA/Er 2300 

MKKPLVEPFV VHXELPPFR SPEVPPHQOC RIWUZESTL SIMLAELMK 2350 

SFGSSSnSGI TGOnnSSE PAPSGCPFD6 EVESYSSMPP IrfdAHlMJL 2400 

SDGSWSIVSS GADIECWVOC SMSYSWIGAL VrTCAAEECK IiPINAI£NSL 2450 

U^HHNLVYST TSRSAOCK^ KVTEIFLCVL DSHYQCWIKE VKAAASKWKA 2500 

NLL2VEEACS LIPEHSAKSK FGYCSm^ HAPKAMAHIN SVNKESJLEDS 2550 

Vl'PUUiTi^ KNEVFCVQPE KQGRKBARLI VFRIGJECJC IS^SNLXDJVS 2600 

KLPE^AAGSS 'YGCQZSFQQIR VEFI^^QAMKS KKH^IjFSYD ISCFCGIVIE 2650 

SDlKi'EliAIY QOCUDPQftR VAIKSLTEPL YV3C3PLTMSR GEN03XRRCR 2700 

ASGVLTTSCX; ^^[IiTCYIKAR AACRAAGKP CIMLVOGCCL WICESAGVQ 2750 

ECftASLRAFT EftMERYSAPP GDPPQPEYEL ELTISCSSNV SVAHDGAGKR 2800 

WfYLIRDPrr PLARAAWEIA RHm^lSNLQ NIIMEAPIU^ AFMIIllIHFF 2850 

SV/LIARD^ QALMGEIYGA. CYSTEPIILP PIIQRIH3LS AESIflSYSFG 2900 

EINRWAACIR KLGVPELPAW RHRARSV/RAR LLSRQGRAAI CX3QIfN&«7 2950 

RIKLKLTPIA AAGRLDLSQW FTAGYSOGDI YHSVSHARPR WFWFOLLLLA 3000 

AGMSIYLLfN R 3011 
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QCXSmXXr 1GATO93QCX: GftCfiCiaC3C 50 

QGAACUOG ' ILTIC AQGCA GAAAQOGUCT MSXPITGCCG TTOGIKKaG ICQ 

TCiOSroCRG 0CroCftG3ftC aXOXICOC GOSftGaGCXA TR Ljiu^lLlG 150 

aSGAAOQQGT GAGERCR0CX3 GAflTIQCiaG GAaSftDOQQS 'lUL'iTiLTiU 200 

GftTCAACXXG CICAMGOCT GGftGRT37IGS GLUlUUUCq: GCX3aGft CIGC 250 

mxxxsMsiA ui u r iGGGjic oaaAfioaoc tigiqgito' GociGRaaoG 300 

GianTGQGA GTCIGOXGQS AGGICKCTA GAUUb'IGCAC CKTGAGCAaS 350 

AATCCEAAAC CICAAAGAAA AADCAAftOGfT AACACCAAlX GCXXSOOCftC^ 400 

GSAOglCAAG 'I'lUXGSGOG GTOSICftGftT c m ' lU i:j i Q3A GTl'mTIUl' 450 

1G00GCX3CAG GGC3CXXXaQC3 TlU LjUiGIGC GOGOGRCTaG GRftOXTIOC 500 

GRGCagTOGC AAGCIOC3IQC5 AAQQOGRCAA OdMOO^ AAJJlOjjJi 550 

ACmSftQQGC ^399CXnQQG CICflGOOOQG G?I30DCTIG3 OCDCTCIKIG 600 

GCAMGAGGG CCIGOOGTOG GCAQSOQQC TOCIGICMX: GaOOOQCIGC 650 

CX3QCCEftGrr GQGQQQCX»C GGADCCXXGG 03m33I02C CTftftCITOGG 700 

TAAGGICA3C GATACCmA CAIGCJQQCIT C3QCXX3A3CIC AIQQQGnaCA 750 

' i'lOlGCTCCT UGGCGOOOGC CTRGQQQQOG CIGOCAGQGC CTIGGCACAC 800 

GGIGICCQQG TICTOSAGGA CGCaQGIGAAC TMQCAACftG QGAfiCT IGCX: 850 

CGGITOCnCT TICICEATCT TCCICTIQQC TCIQCIGIOC 'iUri'i UftOCft. 900 

ICCCAQCnC CCSCTDmSA GIGOQCAAaS TGflGaQQGRr MftO^CGIC 950 

AOGAAOGftCr GCroCAACIC AfiQCKTIGIG TMGRQCX3«3 CXSGfiOCT^ 1000 

CATQCATACT C3CXX3GGIGOG IGU U L' IUlUr TCAGGAGGGT AACAQCIDOC 1050 

GTIGCroGGT AQQQCICACT O0CACX3CTCG CX3QaCAGGAA 1Q0CAGC3GTC 1100 

CCCACEACGA CAATAOGftCXS OCAOSTCGAC TIUL'IUJITG QGACXXSCIGC 1150 

'ITiUlGCICC anAlGTACG TQQ333AICr CIQCXSGftlCT ATITI0CICX3 1200 

ICICCCAQCT GTICAOCnC TOQOCTOQCrJSQCMGflGAC AGIQCftSSfiC 1250 

TOCAACroCT CAATCEATOC OQQQCATOIA TCAGGflCACX: GCAIQQCTIG 1300 

QGATATGATG ATGAACTQCT CMJZIKMC AQOOCEAGIG GTGIOSCAGr 1350 

TOCTCCQGAT CCCACAAGCT GTOGIQGftCA 1QGIGQ0QQG QQOQCACIGG 1400 

GSAGrrccIQS CX3Q3C}CTIGC CIACEATIOC A3GSIRQQGA. ACTGQQCIAA 1450 

QGrnCIGATT GfroOOQCIftC TCnTGOOQG GGirGAOOGS GAGft CXXaCA. 1500 

CX3ACG3QGAG O3IG030G3C CACAOZACrT 0Q93G?rrcAC GTOOCTmC 1550 

TCAICIQGGG CGTICICAGAA AAaOCAGCTT GIGAMaOCA AOSQCAQCIG 1600 

GCACATCAAC AGSOQGCX: TAAATIGCAA T5A CTO0C IC CAAACIGQGT 1650 

Tcnroacxac GcromTAC ocACACAAGr tcaactogic cxjugiqcocg 1700 

GAGCGCATQG OCAGCIGOOG CXXCATIGAC TGGfnOQOOC AGGGGnG33S 1750 

(XCCATCACC TATACIAAGC CEAACAGCIC QGATCAGAG3 OCITATIGCT 1800 

GQCATTAD3C GCXnOSAOQG 'IGTGaGICG TACOOQOCSIC QMSIGIGT 1850 

GCnOCAGIGT ATIGinCAC OXAAQOOCT GTIGIGGIGS GGAOCACOGA 1900 
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' lUJi 'lUUGgr GfTOOCmDGT M30na3QG QGaGAAIGRG AOGAOTGA 1950 

TOCIOCICAA CMCta303t (XX3CCl£MG QCMCTSSTT UUUL'lUiftCA 2000 

-lOGAIGAfiaA CjiiftL'lUULjlT CACIAftGftCG IGOQGftOGflC (J UUmUj-CftA, 2050 

CA'iUGGaaGG GICXX3IAA0C QCAOCTIGAT CmCCCCACG GfiCIQCnCC 2100 

OGAAQCACXX: GGftQGCTOCr TRCACAAAKT GIG3Cia33G CJUUL'iUJi'iU 2150 

ACaC3CTftQC3r QOCiaGiaGA CiaCOCMaC A3QL'mUUL' AC m CCO CT S 2200 

aoCTCAKT 'i ' mUL:A!l CT TTAAGSnaG GfilGIMQIG 0993938108 2250 

AQC30Q9CT <3^MX3Cn3CA TSCSWOTOCSA. CTOGROSftGA GGGdUrA^ 2300 

TIGGRQGfiCA QQGAnaOSIC itfSMOCAQC (JLUL'iUJlU: TSICISO^ 2350 

AGAGflOaCftG AIRCIG30CT GJlULTi'lCftC OOCTODG O-'iTmULA 2400 

CTOGfrnGKT (XAICroCAT CftGftAC2mD3 1GGR03IGCA ATftLXJlLJi»C 2450 

GGnGERGGGT CAO Um ' l ' iGl ' CiULTl'iULA ATCAAMGQG AGIRCMGCT 2500 

UilULTmU CTIUIC C X G G . CAGAOSaXXS OSIGIGIQOC TQCITSIQGA 2550 

TCATOCIQCr GMSGODCAG GCIGAGQC3QG OCTEftGfiGAA L'l'iUJlUUiC 2600 

CICAMGOQG OSroOGIQQC 03C3AG03CAT Os'lMICICT OJiTlLTlUr 2650 

GnCnCTQC (iJUUCC'ICiG,T ACMTAAQQG CaQQCIQQCT (JL'iUUJJUUG 2700 

QGIATOCnr TIMGGOGIA TGGUUUL'IQC 'lUL'lGC'IOCT AL'lUUUbTlA 2750 

ccAccnccaG cnaoQCETr OGAOoooGftG MoacrxAT asroaoQaoG 2800 

TOCG b ' i ' IUiT GEftQGICIQG TATICTIGAC CITGICRDCA TACIACAAfiG 2850 

IGmCICftC TAGQCTCAIA TOJlUJi'ilAC AATftCnTAT CACCAGftGOC 2900 

GAOQOQCACA TSCAAGflGIG GO'lCCCOGOC CICAACXSTIC GQ33ftQQQCX3 2950 

CGAT3CXMC MOCIGCICA ajlUlGOGSr TCKTXftGftG TIAKnTnG 3000 

ACATCAJXAA AL'IUC'iGClC GOCKEACICG G0CX3QCICAT GGiGCIOCAG 3050 

GCIG3CAIAA CX3AGAGIG0C CTftCTTOGIG OGCJGC'ICAAG GQCICAnOS 3100 

TOCATOCAlG TI1AGK303AA AftGIOQOOQS Q3GICATEAT GKr&aaiQG 3150 

TCTICftlGAA GCTO930G0G CIGACAQGfIA (DGIADGTnA TAADuAICIT 3200 

ACCCCACIQC QQGACIG9X OCaCXaOOOQC CIMGftGADC TIUCUanaGC 3250 

GSERGW300C mOJiUi ' lUi ' CXDQOCSOQGA GAOCAAGGfIC MCADCTO3S 3300 

GAQCAGACAC a3CIQ0GIGr GQQGftCftICA TCnGQGrrCr ACXXGBITCC 3350 

QQCXDGAAGQG QGAAGGAGAT A'iTl'l'lUOGA C033nGAIA GlCIOSAftGG 3400 

GCAAQQGIQG OaACIOCTIG OQCDCftTCAC QQOCTACICX: CAACAAftOQC 3450 

QQQQOCSrnCT TQCSnOCATC A3CACIRG0C TC30GQCXX3 G GRCRflGaftC 3500 

CA0C3IDSAAG GOGAGGmCA AGIQGTnCr AiDOOCAACAC AAIlLTi'iULT 3550 

GQOGADCIQC T'ilCAAOQQDG TGIQCTQGAC IGICT^JDCAT G903CTO9CT 3600 

(DGAAGACXXT M3C033TCCA AAAGC3IOC3A ICAOXAAAT GDVCAOCAAT 3650 

GIAGAIXTOG AOCK3SID3G CIGQCAQQ03 G0000CX3QQG GQOQCICCAT 3700 

GACADCATOC AGCIGTOXA. QCTOOGAOCT TTRCTIGSIC ADGAGACA3G 3750 

CTGATGTCAT TOOGSTXXSC CQQQGaOQOG ACAQCAQQQG AAGTECIftCIC 3800 
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TCXX30CAGQC OCDGICIOCIA (XIGAAAGQC TCCIC333IG GIOCATIQCT 3850 

TIG UUL 'ITO S ajCJCftOJlU S •iUiJLXJLVLT CD33X1GCT GiGiOa OCC 3900 

G999GGIICX9C GftAGQOQSIG GACTIGAilAC OCXSTIGAGIC TA!IGGAAACT 3950 

AUJAJGCOGT CKTOSICIT CACRGACAAC TCAAOCOXC CX39CD3iaaC 4000 

QCAGACMTC CAflGflGQCAC AICIQC303C TCXIDRCIGQC AXG3C7«GA. 4050 

QCaOCTiAAGr GOCXJX'lUm TAIGCftQCXX: AfioasnoA OSIOCiaQIC 4100 

ciGAftoooGEr (joji'iuDum: eaocmaGas ' mujLUjg r MaroioaA. 4i50 

GQCTOmsr ADGGAGOCTA ACKICAGAAC IGSSSIAAGS AOC?m»GC^ - 4200 

0Q33G9XTC CATDUDGTAC TOCAOCTRIG QCAfibTlULT TOCaSmST 4250 

GGCronCIG GGGQGQOCTA. IGACATCTOA AajmSIGATO AGIQ3C30C 4300 

i^ACIGACIOS ACXMXAICr TQ93C3^I03Q OOGIUCIG GACXS^AGOaS 4350 

AGaOjLJC'lU a A3CX9CX39CIC GJUSIQCICG CCMXXSCmC AOC'IUXUL^ 4400 

' iLUaT-Lift oos iGocacftooc cRftaaroaflG GRAaaaoacx: TsrocAACAA. 4450 

T3C»GACaTC CXXnCIMG QCAAAGdC»T aCOCAl'lGAG QCCATCAAGG 4500 

GQQGGAG3CA TCTCATITIC TOOGATTOCA AGAAGAAA3G IGAOGAGCIC 4550 

GCX3QCAAAQC TGACAG3GCT OQGACIGAAC QCIGEAQCAT ATjaODQQQS 4600 

OCTIGAIGIG TO03ICATRC OJOJIMUGG; AGAOGflDSIT GfTOGIQQCAA 4650 

CAGAOQCICr AKIGADQQGT TTCADCXSQOG Al'mUACIC AGIGKTOGAC 4700 

TQCAAIRCAT GflCflCACXDCA GACAGTOGAC TICAQCTIQG AaXXX3^CCTr 4750 

CAaC3VIT3^ ACJGACGACXX; TQOOOCAAGA (DOOOGflGTOG CGCTOQCAAC 4800 

OGOGAOSIftG AACIQQCAGG GGE^QGAGIG OCAIICIACAG GITIGIGACr 4850 

OCAQGAGAAC G3CXXTO93G CATCTrDGAT TCTTOOGIOC TCflGIGAGIG 4900 

CTA3GADQ03 GG L'IUIUL ' IT QGflMGAQCT CAOCm33CT GAGACCTOQG 4950 

TTA Ljb ' I ' lU U G GGCmCXITA AAIACAOCAG (JJi'lUCXXXST CIQCCAOSAC 5000 

CATCIGGAGT 1CB33GAGAG CGICTICACA GaCXnCAOCC ACAJERG?flGC 5050 

OCACnoriG TDOCAGACIA AACftGaCAQS AGACAACriT OCmCETOG 5100 

TGQCT^'ICA AGCIACMJIG TGOGOCAGQS CICAAOCICX: AOCTOCAIOS 5150 

TOQGACCAAA TOTOGAAGIG ICICKDmS CIGAAAOCIA CACIGCAOSG 5200 

GCX^ACAlDCX: CIOIIGTATA GOCHTOSAQC 03IDCAAAAT GAGGTLCA^IOJ 5250 

TCACACADCX: CAaSVACTAAA TACfilCAIQG CAIQCA3GIC GGCIGAOUG 5300 

GAQGIDGICA CmxpCCTG GGIGC'IGGTA GGOOGAGICC TIUJAGLTIT 5350 

GQOOQCAZAC T3UL'ilaA0GA CAOGCAGIGT GCflCAil'IGlG GQCAGGAICA 5400 

' lUl ' lUiULULi GAAGOCAGCT GlOJi'lUULG ACAG33AAGr OCTCIAOCAG 5450 

GAGITCGAIG AGATOGAAGA G'lG'IGUCICA CAAS^SSTT ACA3DGA0CA 5500 

GGGAATOCAG CIOGOOGAGC AATICAAGCA AAAQ0CX3CIC GQGnGTIQC 5550 

AAAOQOCCAC CAAGCaAOOG GAQQCIQCIG CICOOSIGST GGAGIOCAAG 5600 

IX3GCX3AGOOC TTGAGAOCTr CIQQ3CX3AAG CACAIGIQGA ATTICAICAG 5650 

OQGAATACAG TACXHAGCAG GCTEATOCAC TCIGOCIQGA AA0C3G0GQGA 5700 
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a3iQC3flCKET GMOQCSflTT JOULTIUiA TCftCEftOOOC OTICROCftOC 5750 

CAAAfiCaOOC TOCIGimA. C AILTlUJm Q3?flX33SIQG CiGOGLAftCT 5800 

aSCroCICDC AGOGCIGOST CAQCITICGr 0330300390 AIOG0a3GfiG 5850 

OJ U L ' lUri GG CAQCftlRGQC CnGSGAftGG 1QCICX3IG3A CKICnQQCG 5900 

ojL'i m oaGS aasasmx: oggoscacic uiujul^itia ;^3GiiCKrGM3 5950 

gjCJOaftGSIG aX'I02ftCXX3 AOGAUL'ICGT CAACTSOC (X'lGGCATOC 6000 

'lu i L'ic oas ' max ' iumc o'lLmjiJiCG auiuxoysc AMracioosr soso 

OQQCaOSiaG GaXG3GAGft. UU U UUL ' IG I G CAGIGGMGA. AD03aCIGKr 6100 

AGOSTTOQCT T3903C3Q3IA ADaOSICIC aX!ia03CAC OMUiUUL'lU 6150 

ASmSAOQO 1QCM3CMX3r GICXnnCA 'lUL'iC'iCERG CTTEAOCZ^ 6200 

j£n!CM£nSC TGfiM3a3Xn^ 6250 

QCCA3GCTOC QaCIOSIGQC 'DifiQ3GMr3r TIGaSftTIQC; AmiQCSmS 6300 

TCTIGftCIGft. CnCAACaDC TOaCTXAGfT OCAAACICCT GaDQa3GnA 6350 

COQQSAGKX: CITIOCIGIC A3QCX3\AOQC G9SEACAfiGG GAGICIOC303 6400 

GGQGGAOSGC ATCAIGCAA^ a a CCTCOX AIG03GAGCA CAGKIOQOOG 6450 

GACATOIC2A AAfta9Sna: AIGAOGKrOS TAG930CEAG AAOCTQCAGC 6500 

AACAOGTOQC ACGCS^AOSTT (XOCflTCAAC QCM3O03^ a39GAOCTIG 6550 

CACACDCTOC (DOQQOQCIXA ACIMTOCAG QQOXnMQG CX3QGfIQQCIG 6600 

CIGAOGT^OA GSIGGAQ3IT ACXXGIGIGG QQGKITiULA CIAGSZGACG 6650 

QGCPOXSCCh CIGACAAOGT AAAGTOOXA 'lUULftLUi'lL' aaaOCXXXG R . 6700 

ATICnCACX; GAOGflOGATC GAGflOaSGIT QCACftQSIAC CCKXXXXXST 6750 

GCAAftOJICT TCTACX3GGAG GAOGICAOGfT TOCftOGTOOG QCICAADOA 6800 

TACriGSrOG GGTOaCAOCT OOGAIQOSftG CXX3GAAGCX3G ADGTEAftCAGr 6850 

(JLTJLftLTlUJ MGCICftODG ATODCKX3CA CATIftCAQCA GWSALUJL'JA 6900 

AGOSIftQQCT QQCTAGftG3G TCiaXOXrr CnTRQOCftG C3C3aCftGCT 6950 

AQCJCAGTICT CIGOQCXnTC TITCAAGSaS ACAHQCftCIA CXXaOCRTCA 7000 

CiaDOQQGAC QCIGAOCICA laaQOOSA aCTCrrSIGG OaOCAGSAGA 7050 

TOOOOSGAAA CATCAdOSC GIQGAGICAG AGAKD^AGGfT AGE^ATICIG 7100 

GACICnrOG AACOQCnCft CXaOSGftGQQG GATCBGfiQQG AGATKTOOGT 7150 

CJQOQQOaGftG AT0CIQO3AA AMCCRGSAA Ul'lUJUUlLA OJUi'lUUHA 7200 

TAT03QCACX3 CCCGCMJIM: AATOCTOCAC TOOIAGftGIC CIQGAAG3AC 7250 

CXX33AC]3^ ITXCICCGOT QGnACAOaGA TOOXaTIQC CftDCIWDCAA 7300 

QQCICOCCA ATftCCAOCIC CAOSSAGAAA GAGGaOGGIT G'lOJlGACAG 7350 

AATas^flror Grcncioa: tiqqoggrqc toqocrciaa galx-tilulji' 7400 

AGCnOOQGAT CCBBGOCOjl' TGKERGOOGC AOQQOGAOOS ODCTICJCrGA 7450 

CCIt33QCia: GAOGAOQGIG ACAAA03ATC OGADGfTIGAG IDGERCICCT 7500 

CCATOCXXDC 0CTIGAAG3G GAGO0G3GQG ADOOaSATCT CAGOGftOaOG 7550 

mnOGICIA CCGTCiAGIGA. QGAGGCrAGT GAGGATCTOG '1CIGC1QCTC 7600 
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fATGICCmr ADSIGGRCaG Q030C3CIGKr CROSOTIQC G L'iUJm flGS 7650 

AAAGflftfiGCT GOOCftTCRftC CXUPIGRGCA AClLTi'lULT G33ICA0CAC 7700 

AACMQSICr ADQOCACAAC AiaOOQCaQC QCAAOOCIOC G3CaGAAGAA 7750 

Ggiea ULTlT GACaGAnGC AftGItXTOGA IGAICATEAC 033GftOS iaC 7800 

TCAfiQG?iGKr GZ^ftQQOGAftG QCJSIOCSOG TD^QOaSA GLTilJimUi' 7850 

MftGRQGMS UC' l GC ft ftGCT G ft LUOX TO CaiTOOQaCA AftDRXftAftTT 7900 

IGGCERTOOS QCaAftGGROS 1Da3GAMrT MOCfiQCaOS GOCJSmAOC 7950 

jOflCmifIC C)GTOIG9G?«3 GACTIQCTO 8000 

GACftOSmi TCAIQQCAAA AAGIGfiQ3IT TiUiUUGiXCC AAOCftGRG?A 8050 

QgGftGXaaC AAOOCftQCIC QOCTiaTOGr AITOXAGRC CIQQGftGnC 8100 

GTOIMQOGA CaAGftlGCaOC CTITOGGBOS ' i UjiCIOCaC OTianOG 8150 

GCXXncaaOG GCIGCICMA CXaSKmCAA TftCIOCXXXA AOCaOOGOT 8200 

CXSAGTTOCIG GIGAATAOCT QGAAMCAAA GAAAIGOOCT AiUUULTIUi' 8250 

CATA3GACAC 0CX3 L ' iUiTi'l ' GACTCAAOaG TCACIGAtaG TSOTIOSr 8300 

GTIGAGGAGfr CAATmOCA AIGTIGIGAC TIQQC3000CX3 AGGOCAG ACA 8350 

GQC3CAn3«GS TOQCICACAG AQLUULTl'lA CSmDQQQQGT OJUL'IUACIA 8400 

ACICAAAAGS GCAGAACIGC GUri !ft !ia30C GG;JG0 Cm3C AAGIGGOCSIG 8450 

CIGAQSACIA GCIGCGGT C AA TADCXILCACA TCmCTTGA A033CACTOC 8500 

■AGCCIGICGA GCIQCAAAQC 10CAGGACIG CAOSATOCIC GIGAA03GAG 8550 

ACGA UL ' I ' lUl ' Ojl ' IMUlUi ' GAAAQQQOaG GAADOCAQGA GGATOOGOaS 8600 

QCCCIROGAG OCnCAaaGA GQCIAroACT AQGranOOG GOOOOOOCOS. 8650 

G3AIOCXX30C CAACX^GAAT AQGAOCIGGA GCIGAIAACA TCATCflTOCT 8700 

CCAATCflGIC AGICX3CX3CAC GATOCAICIG GCAAAAQQC?! MftCEOnC 8750 

ACCCXnGADC C!CACCADOOC (XTIQCAOGG GCIQ0GfIQ3G AGACftGCiaG 8800 

ACACACroCA ATCAACICrr QQCTM3QCAA. TSOCAIlXaTC TKIG pGCOCA 8850 

GCCTATOGGC AAGGATCATT CIGATCACIC ACmTiUlL' CSOCCITCIA 8900 

QCICAAGAGC AACTIGAAAA AG0CCIG3AT TCICAGATCT AD3903CTIG 8950 

CTACTOCATr G^GGCACTIG AaTTACCICA GAICATIGAA OGACTOCAIG 9000 

GICrrAGOOC AmACACIC CACAGnTACr Cia3«3C3IGA GAJCAAIAGG 9050 

GIOacnCAT GQCICAGGAA ALTiUUUb'iA OCACCmGC GAACCIQGAG 9100 

ACATOGQQOC AGAAGIGTOC GOQCTAAGCr ACIGIOOCAG OC3G3QGAQGG 9150 

(XGCJCACriG TSXAGftTAC ClUl ' l ' IA ACT GQQCAGIAAG GAOCAAGCIT 9200 

AAACICACIC CAATOCXX33C OGOGICOC A G CIGGACTIGT CiULL'lLGTr 9250 

aJiUGCIGSr TACAQC033G GAGACATAIA TCACAQOCIG TCIOGIGOOC 9300 

GACXDCXX3CIG Ul ' l ' lLUUl ' lU T3C3CTACKX: TAClTiClUr AQQ93IM3GC 9350 

ATITAOCrGC TOQQCAACDOG AlGAAOaOOG AGCIAAOCAC T CCPGOCCTT 9400 

AAGCCATnc ciui ' iTiTi ' i ' Ti ' i ' i ' iTiTri ' TnTmnT TcmnnT 9450 

TTICTITOCT ' I ' iLLTlL ' i ' i ' r ' iTl'iLL ' i ' i ' lL ' TTITIOCXnT CnTAATOGTr 9500 
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GGC'iaCftTCT TMSCCCmsr CMJGCCmX: TCIGAAAOST OOGIGfiOaOG 9550 
CMX3ACIGCA. GAGAGIGCIG AaSOQQCCT ClOGCaGAT CATOT 9595 
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MSINEKPQRK 1KFNINRRPQ rWKFPQQ3QI VOGVYLLERR GEKDC3VRMR 50 

KASER9QEK5 RRQPIFKARR i=H3Wt©QPG YFWPLYGNEB LQ&CiQWUSP 100 

RGSRPSNGFT DERRRSFNLG KVlLUL'lUaP AEUOflPLV GAELQGAABA, 150 

I^m^FVUD GUNJOIIGNLP GCSFSimA U^CUnPAS fiXEJmJSCSL 200 

mTlNDCSNS SIVmACWI MHTPQCVPCV QEENSSRO^ AIIIHIAAFN 250 

ASVPimER HVEOiVGEftA FCSfiMYVSDL OGSIFLVSQL l-'Il-SHMifaT 300 

MJOJ^SIYP GHVSGHRMftW EMMgWEPIT ALWSQLU?! PQAWIl'WIftG 350 

AHWGWLAOA YYaiWGNWAK VLI\«LtEftG ViXaUlHTIGR VfiGHirSGFT 400 

SLFSSGASCfC IQLVNINSSEr^ HINRIALNCN IFS^HKFNSS 450 

GC3=1E3^MA9CR PIEWFT^QQWG PnYIKENSS DCRFTCWHC^ ERPOSWEaS 500 

C^OSEVXCFT PSEWWSnD PS3VPIYSWG ENEIUWLLN NIRPPQOgWP 550 ■ 

QCIVWNSIGF IRTOOGPECN IG3V12QFTLI CPIDO-KKHP EftTXlKDSSS 600 

IWLTFRCEAro •XPXRIJWffifPC UNFSIFKVR MYV33WEHE?L M^AQWIKE 650 

lOCEEi^ERS ELSELLLSTT EJOQIUCaPr TLPALSIGLI HEH3PJIVDUQ 700 

YLYGVGSAFV SFAIKWEm, LrjIUAEAR U-'AfTMr^.T. lA^ffiAALEN 750 

LWLNAASVA GAH3ILSFLV FKJ^ftWmG RIAPGAAi!AF vnwj PT tut. gOO 

LALPERAYAL EFEMAASQQG AVLVGLVFLT LSPmVFLT RLIWWLQin 850 

IRAEAHD^WW VPELlMOai EftllLLTCAV HPELTPDIIK LLLAHGELM 900 

VU5AGITRVP yFVRAQGLIR ACMLVRKWAG GHYVQMVFMK IGftLIGIWY 950 ' 

NHLTPLKCt^ HftGLPElAVA VEPWFSAME IKVnWGADT A?m3riLGL 1000 

PVSARRGKEI EDSPADSLEG QGWRLLAPTT AYSQ3IRGWL GCIITSLT3R 1050 

EKT'^JVBGE^ WSmTQSFL ATCINSW3fW VYBSAGSKTL AGEKGPnCM 1100 

YINWQLDLV3 W3APPGARSM TPCSOSSSCL YLVIRHAEVI P7RRRia:SE«3 1150 

SLLSPfUVSY mSSSQSELL CP3GHWGVF RAAVCIRSVA KAVDFIPVES 1200 

METIMRSJVF TTNSTPPAVP QrrFgWAHEK\ PIG9GKSIKV PAAi^AQGyK 1250 

VLVLNPSVAA TDGPGMfl4SK AHSIDrNIRT GVRITTTOGS riYSraSKFI. 1300 

ADQC3CSC3SMf DIIICEECHS lESmUGIG TVIDQAEIAG ARLWIA32RT 1350 

PPC3SVIVEHP NIEEIGLSrW GEIFFW3K7SI PiEAm3GRH LIFCHSKKKC 1400 

EELAAKLIGL GLMAVAYYRG LCVSVIPPIG EVWVTOTftL MIGFTCCFIS 1450 

VUXNICVIQ IVEFSLDPTF 'I'lEiTiVPCP AVSRSQRRGR TGRGRSCarXR 1500 

FVTPGERPSG MFDSSVLCK: "ilftGCAWYEL TPAETSVRLR AmOTGLPV 1550 

Og>JLEFV4ES VnGLUnm HELSQIRQftG TJNFEYUJKfQ ATWaRAQ?^ 1600 

PPSWDQM/«KC LIRIKPIIH3 PTELIiYPDSA VaJJEVjCLlKP riKYIMRQIS 1650 

ADLEWISIW VLVGGVIAAL AATCLTIGSV VI\O^ILSG KPAWEI3?EV 1700 

LYQEFIDEMEE CASQLFflBQ Q^IAEQFYQ KALGLLCTCRT HQAEAAAPW 1750 

ESKWRALZTF V\ftKHM^«JFIS GIQiflAGLST LPC2SIPAIASL MAFTASnSP 1800 

LTTOTIJUFN ILGGWWAAQL AKJSAASAFV GAGIAGAAVG SIGDGKVLVD 1850 

IIAGYGAGVA GALVAFTO^ GEVPSTEDLV NLLPAILSPG ALW/GWCAA 1900 
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UKRHyGECE O^JQ/^SMSl AEASRaJHVS PIHWPESnA ARRVIQILSS 1950 

UrnQ[Il(RL HQMINEECSfr PCSGS&OEV WDWICIVUED BKIWIjQSKLL 2000 

ERLEGVPELS (CT3XKGVWR GEGIMQnCP CXaQIfiOWK NSSMRIVGHl 2050 

TCaJIWHSIF KQKmGPC TPSEMIWSR AIWRMAAESf WIFW3DFH 2100 

WIGMTnUV KCPOtT^EftPE EPIBVEGWRL HR»KRCKPL IKEDJTEQJG 2150 

LNgnWGSQL ECEPEHVIV LTSMLilDPSH TU^EiaKRRL ARGSPPSLAS 2200 

SSASQLSAPS LKZtrCnHHD SEDREtJEaN IIWE^C^M3GN HRWESENKW 2250 

VUXSFEELH AE3CEREIS7 AAEIIKCSRK EPSRLPIWAR KKNEELIES 2300 

WKDED^VPIV VHXHJIPIK i^EEIEEERFK RTNAOflEaJV SSMAEUOK 2350 

IRSSSGSSSV^ ESSEKD^LED lASCDGCRGS EWESXSSMEP IHaEPGDEEL - 2400 

SD3SWSIVSE H^SEDWDCS MSYIWIGALI TPCT^AEESKL PINELSNSLi 2450 

PHHtWVXTflT SRSASLPQEOC Vlt'lKU yiD EffiREWIKEM KRKASIVKWC 2500 

LLSIEEACKL TPEHSAKSKF GKGAKEm^j SSRAVNHIRS VWEELLEDIE 2550 

TPinmWAK SEWPCVQEEK QCa^PARLIV FEDLCWRyCE KMftlMWVST 2600 

LFQMVM3SSY GPQKSEKQRV EELWIWKSK KCEMSFSYDT RCFESIVIES 2650 

DIRVEESIYQ CXHAPEARQ ATRSLISILY IQGPLINSKG QN33XRRCRA 2700 

SGVLTISOOJ 1L'ICai<?flA ACRAAKUgx: TMLVNaXLV VECESfiGIQE 2750 

r»AAIf5APIE AMIFKSAPK3 DPPQEEXIXE LTISCSaWS VRHDASGKRV 2800 

YYL1RDPITP IARA?*JEIftR HITOKSWDai HMSiMTIMA. KMIIMIHFFS , 2850 

niAl^QLEK AIOTnGAC YSIEFLDLPQ IIEE?LH3L£A FILHSYSPGE 2900 

INRVT^SCUaC UGVPPUOWR HRARSVRAKL LSQQGEVATC GRMJIMAVR 2950 

IKLKLTPIPA ASQIXLSGWF VAGYSQGDIY HSLSRARERW FPLX3JLLLSV 3000 

GVGIYUJNR 3010 
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SEQUENCE LISTING 

<110> Bukh, Jens 

Purcell, Robert 
Yanagi, Masayuki 
Emerson , Suzanne 

<120> Infectious cDNA Clone of GB virus B and Uses Thereof 

<130> 2026-4308PC 

<140> TBA 

<141> 2000-06-02 

<1S0> 60/137,694 
<151> 1999-06-04 

<160> 5 

<170> Patentin Ver, 2.1 

<210> 1 
<21I> 9399 
<212> DNA 
<213> GBV-B virus 

<400> 1 

accacaaaca ctccagtttg ttacactccg ctaggaatgc tcctggagca ccccccctag 60 
cagggcgtgg gggatttccc ctgcccgtct gcagaagggt ggagccaacc accttagtat 120 
gtaggcggcg ggactcatga cgctcgcgtg atgacaagcg ccaagcttga cttggatggc 180 
cctgatgggc gttcatgggt tcggtggtgg tggcgcttta ggcagcctcc acgcccacca 240 
cctcccagat agagcggcgg cactgtaggg aagaccgggg accggtcact accaaggacg 300 
-cagacctctt tttgagtatc acgcctccgg aagtagttgg gcaagcccac ctatatgtgt 360 
tgggatggtt ggggttagcc atccataccg tactgcctga tagggtcctt gcgaggggat 420 
ctgggagtct cgtagaccgt agcacatgcc tgttatttct actcaaacaa gtcctgtacc 4 80 
tgcgcccaga acgcgcaaga acaagcagac gcaggcttca tatcctgtgt ccattaaaac 540 
atctgttgaa aggggacaac gagcaaagcg caaagtccag cgcgatgctc ggcctcgtaa 600 
ttacaaaatt gctggtatcc atgatggctt gcagacattg gctcaggctg ctttgccagc 660 
tcatggttgg ggacgccaag accctcgcca taagtctcgc aatcttggaa tccttctgga 720 
ttaccctttg gggtggattg gtgatgttac aactcacaca cctctagtag gcccgctggt 780 
ggcaggagcg gtcgttcgac cagtctgcca gatagtacgc ttgctggagg atggagtcaa 840 
ctgggctact ggttggttcg gtgtccacct ttttgtggta tgtctgctat ctttggcctg 900 
tccctgtagt ggggcgcggg tcactgaccc agacacaaat accacaatcc tgaccaattg 960 
ctgccagcgt aatcaggtta tctattgttc tccttccact tgcctacacg agcctggttg 1020 
tgtgatctgt gcggacgagt gctgggttcc cgccaatccg tacatctcac acccttccaa 1080 
ttggactggc acggactcct tcttggctga ccacattgat tttgttatgg gcgctcttgt 114 0 
gacctgtgac gcccttgaca ttggtgagtt gtgtggtgcg tgtgtattag tcggtgactg 1200 
gcttgtcagg cactggctta ttcacataga cctcaatgaa actggtactt gttacctgga 1260 
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agtgcccact ggaatagatc ctgggttcct 
cgaggctgtc atcttcttga ccaaactggc 
gtttagcagt gtacactacc tggcggttgg 
gtggtatcag ttgctcctag cgcttatgct 
cagggtgccc actggatgct caatagctga 
ttgccactct tatttgagtg agaatgtgtc 
caggcctatc actctagagt ataacaactc 
tgcgagggga tgtatggtta aattcaaaaa 
tgtgccatcg tactgcacta tgggcactga 
cgaagcatgc ggtgtaacac catggctaac 
attggctata ttacaatacc ctgggtctaa 
aggccatttg tattttgagg gatcagatac 
ttccactctc ctaccaccgg agaggtgggc 
tggttcttgg ttacaggttc cgcaagggtt 
attgatcacc aaagacaaag cctggaaaaa 
tttgtctctt acgggagtta ccaccaaggc 
cagcaagtat cttattttag cctacctctg 
tggttaccct ttgcgtcctg tgctcccatc 
tttgtctaaa gctcaagtag ctccttttgc 
ctgcaggcta cgttatgctg cccttttagg 
aactttcttt gttgcagcag ctgctgccca 
agtggcaggg ttagttttgt gggccggccg 
aggtccttgg cctctggtag cgcttttaac 
ttttgatacc gagataattg gagggctgac 
gtctcgtttt ggcttctttg ctcacttgtt 
ttggcaacgt tgggagaatt ggttttggaa 
tgtgctggtt tgtttccccg gtgcgacata 
cgtagctctt ctatgtttaa catccagtgc 
tagggcccat agaatgttgg tgcgtctcgg 
tcttaagttt ttcctcttag tgtttggtga 
tggtgatgtc ttgcctaatg attttgcctc 
ttttgaaggc aaggcaaggg tctataggaa 
gSttgatggt ttgcccgttg ttgcgcgtct 
gccgccagat gggtgggcca ttaccgcacc 
cacgctgtca gcgatggcag tggtcatgac 
tatcttcaga ttaggatctc tggccactag 
gtatactgct caccatggca gcaaggggcg 
cccaataacc gttgacgcgg ctaatgacca 
gtcccttact cggtgctctt gcggggagac 
"^ttggttgag gtcaacaaat ccgatgaccc 
ggctgttgcc aagggttctt caggtgcccc 
gatgttcacc gctgctagaa attctggcgg 
ggtgtgtgct ggataccatc cccagtacac 
tgtgcctaac gagtattcag tgcaaatttt 
caaattacca ctttcttaca tgcaggagaa 
ggctacaaca gcatcaatgc caaagtacat 
ctattttaat ggcaaatgta ccaacacagg 
gtacctgacc ggagcatgtt cccggaacta 



agggtttatc 


gggtggatgg 


ccggcaaggt 


1320 


ttcacaagta 


ccatacgcta 


ttgcgactat 


1380 


cgctctgatc 


tactatgcct 


ctcggggcaa 


1440 


ttacatagaa 


gcgacctctg 


gaaaccccat 


1500 


gttttgctcg 


cctttgatga 


taccatgtcc 


1560 


agaagtcatt 


tgttacagtc 


caaagtggac 


1620 


catatcttgg 


tacccctata 


caatccctgg 


1680 


taacacatgg 


ggttgctgcc 


gtattcgcaa 


1740 


tgcagtgtgg 


aacgacactc 


gcaacactta 


1800 


aaccgcatgg 


cacaacggct 


cagccctgaa 


1860 


agaaatgttt 


aaacctcata 


attggatgtc 


1920 


ccctatagtt 


tacttttatg 


accctgtgaa 


1980 


taggttgccc 


ggtaccccac 


ctgtggtacg 


2040 


ttacagtgat 


gtgaaagacc 


tagccacagg 


2100 


ttatcaggtc 


ttatattccg 


ccacgggtgc 


2160 


cgtggtgcta 


attctgttgg 


ggttgtgtgg 


2220 


ttacttgtcc 


ctttgttttg 


ggcgcgcttc 


2280 


ccagtcgtat 


ctccaagctg 


gctgggatgt 


2340 


tttgattttc 


ttcatctgtt 


gctatctccg 


2400 


gtttgtgccc 


atggctgcgg 


gcttgcccct 


2460 


accagattat 


gactggtggg 


tgcgactgct 


2520 


taaccgtggt 


caccgcatag 


ctctgcttgt 


2580 


cctcttgcat 


ttggttacgc 


ctgcttcagc 


2640 


aataccacct 


gtagtagcat 


tagttgtcat 


2700 


acctcgctgt 


gctttagtta 


actcctatct 


2760 


cgttacacta 


agaccggaga 


ggtttttcct 


2820 


tgacgcgctg 


gtgactttct 


gtgtgtgtca 


2880 


agcatcgttc 


tttgggactg 


actctagggt 


2940 


aaagtgtcat 


gcttggtatt 


ctcattatgt 


3000 


gaatggtgtg 


tttttctata 


agcacttgca 


3060 


gaaactacca 


ttgcaagagc 


catttttccc 


3120 


tgaaggaaga 


cgcttggcgt 


gtggggacac 


3180 


cggcgacctt 


gttttcgcag 


ggttggctat 


3240 


ttttacgctg 


cagtgtctct 


ctgaacgtgg 


3300 


tggtatagac 


ccccgaactt 


ggactggaac 


3360 


ctacatggga 


tttgtttgtg 


acaacgtgtt 


3420 


ccggttggct 


catcccacag 


gctctataca 


3480 


ggacatctat 


caaccaccat 


gtggagctgg 


3540 


caaggggtat 


ctggtaacac 


gactggggtc 


3600 


ttattggtgt 


gtgtgcgggg 


cccttcccat 


3660 


gattctgtgc 


tcctccgggc 


atgttattgg 


3720 


ttcagtcagt 


cagattaggg 


ttaggccgtt 


3780 


agcacatgcc 


actcttgata 


caaaacctac 


3840 


aattgccccc 


actggcagcg 


gcaagtcaac 


3900 


gtatgaggtc 


ttggtcctaa 


atcccagtgt 


3960 


gcacgcgacg 


tacggcgtga 


atccaaattg 


4020 


ggcttcactt 


acgtacagca 


catatggcat 


4080 


tgatgtaatc 


atttgtgacg 


aatgccatgc 


4140 
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taccgatgca accaccgtgt tgggcattgg aaaggtccta accgaagctc catccaaaaa 4200 
tgttaggcta gtggttcttg ccacggctac cccccctgga gtaatcccta caccacatgc 4260 
caacataact gagattcaat taaccgatga aggcactatc ccctttcatg gaaaaaagat 4320 
taaggaggaa aatctgaaga aagggagaca ccttatcttt gaggctacca aaaaacactg 4380 
tgatgagctt gctaacgagt tagctcgaaa gggaataaca gctgtctctt actatagggg 444 0 
atgtgacatc tcaaaaatcc ctgagggcga ctgtgtagta gttgccactg atgccttgtg 4500 
tacagggtac actggtgact ttgattccgt gtatgactgc agcctcatgg tagaaggcac 4560 
atgccatgtt gaccttgacc ctactttcac catgggtgtt cgtgtgtgcg gggtttcagc 4620 
aatagttaaa ggccagcgta ggggccgcac aggccgtggg agagctggca tatactacta 4680 
tgtagacggg agttgtaccc cttcgggtat ggttcctgaa tgcaacattg ttgaagcctt 4740 
cgacgcagcc aaggcatggt atggtttgtc atcaacagaa gctcaaacta ttctggacac 4800 
ctatcgcacc caacctgggt tacctgcgat aggagcaaat ttggacgagt gggctgatct 4660 
cttttctatg gtcaaccccg aaccttcatt tgtcaatact gcaaaaagaa ctgctgacaa 4920 
ttatgttttg ttgactgcag cccaactaca actgtgtcat cagtatggct atgctgctcc 4980 
caatgacgca ccacggtggc agggagcccg gcttgggaaa aaaccttgtg gggttctgtg 5040 
gcgcttggac ggcgctgacg cctgtcctgg cccagagccc agcgaggtga ccagatacca 5100 
aatgtgcttc actgaagtca atacttctgg gacagccgca ctcgctgttg gcgttggagt 5160 
ggctatggct tatctagcca ttgacacttt tggcgccact tgtgtgcggc gttgctggtc 5220 
tattacatca gtccctaccg gtgctactgt cgccccagtg gttgacgaag aagaaatcgt 5280 
ggaggagtgt gcatcattca ttcccttgga ggccatggtt gctgcaattg acaagctgaa 5340 
gagtacaatc accacaacta gtcctttcac attggaaacc gcccttgaaa aacttaacac 5400 
ctttcttggg cctcatgcag ctacaatcct tgctatcata gagtattgct gtggtttagt 54 60 
cactttacct gacaatccct ttgcatcatg cgtgtttgct ttcattgcgg gtattactac 5520 
cccactacct cacaagatca aaatgttcct gtcattattt ggaggcgcaa ttgcgtccaa 5580 
gcttacagac gctagaggcg cactggcgtt catgatggcc ggggctgcgg gaacagctct 5640 
tggtacatgg acatcggtgg gttttgtctt tgacatgcta ggcggctatg ctgccgcctc 5700 
atccactgct tgcttgacat ttaaatgctt gatgggtgag tggcccacta tggatcagct 5760 
tgctggttta gtctactccg cgttcaatcc ggccgcagga gttgtgggcg tcttgtcagc 5820 
ttgtgcaatg tttgctttga caacagcagg gccagatcac tggcccaaca gacttcttac 5880 
tatgcttgct aggagcaaca ctgtatgtaa tgagtacttt attgccactc gtgacatccg 5940 
caggaagata ctgggcattc tggaggcatc taccccctgg agtgtcatat cagcttgcat 6000 
ccgttggctc cacaccccga cggaggatga ttgcggcctc attgcttggg gtctagagat 6060 
ttggcagtat gtgtgcaatt tctttgtgat ttgctttaat gtccttaaag ctggagttca 6120 
gagcatggtt aacattcctg gttgtccttt ctacagctgc cagaaggggt acaagggccc 6180 
ctggattgga tcaggtatgc tccaagcacg ctgtccatgc ggtgctgaac tcatcttttc 6240 
tgttgagaat ggttttgcaa aactttacaa aggacccaga acttgttcaa attactggag 6300 
aggggctgtt ccagtcaacg ctaggctgtg tgggtcggct agaccggacc caactgattg 6360 
gactagtctt gtcgtcaatt atggcgttag ggactactgt aaatatgaga aaatgggaga 6420 
tcacattttt gttacagcag tatcctctcc aaatgtctgt ttcacccagg tgcccccaac 6480 
cttgagagct gcagtggccg tggacggcgt acaggttcag tgttatctag gtgagcccaa 6540 
aactccttgg acgacatctg cttgctgtta cggtcctgac ggtaagggta aaactgttaa 6600 
gcttcccttc cgcgctgacg gtcacacacc tggtgtgcgc atgcaactta atttgcgtga 6660 
tgcacttgag acaaatgact gtaattccac aaacaacact cctagtgatg aagccgcagt 6720 
gtccgctctt gttttcaaac aggagttgcg gcgtacaaac caattgcttg aggcaatttc 6780 
agctggcgtt gacaccacca aactgccagc cccctccatc gaagaggtag tggtaagaaa 6840 
gcgccagttc cgggcaagaa ctggttcgct taccttgcct ccccctccga gatccgtccc 6900 
aggagtgtca tgtcctgaaa gcctgcaacg aagtgacccg ttagaaggtc cttcaaacct 6960 
ccctccttca ccacctgttc tacagttggc catgccgatg cccctgttgg gagcgggtga 7020 
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gtgtaaccct ttcactgcaa ttggatgtgc 
tttacccagt taccctccca aaaaggaggt 
ggctacaacc gcttccagct acgttactgg 
ttccactcag tcagcccccg ccaaacggcc 
ttcgtgcagc atgagctaca cctggaccga 
tctgtctgca actcgggcca tcactagtgg 
gactgagccg cgggatgcgg agcttagaaa 
gttcccccca tcataccaca agcaagtgag 
cggtgtcatg tgggactatg atgaagtagc 
ccacatcact ggccttcggg gcactgatgt 
ggacttgcag aagtgtgtcg aggcaggtga 
agttccaaag gaggaggtct tcgtgaagac 
gcttatctcg tacccccacc ttgaaatgag 
tgctcctgac gtagttaaag ctgtcatggg 
ccgtgtcaag cgtctgttgt cgatgtggtc 
agtgtgtttt gacagtacca tcacacccga 
agcagctaaa ctcagtgacc aacaccgagc 
cgctggagga ccgatgatcg cttatgatgg 
ttccggcgtc tatactacct caagttccaa 
tgcagccgaa caggctggca tgaagaaccc 
cgtaatttgg aagagcgccg gagcagatgc 
ctggatgaag gtgatgggtg caccacaaga 
agaattaaca tcatgctcat caaatgttac 
ctactttctt acaagagatc ctcgtatccc 
atacaacccc agtgctgcgt ggattgggta 
tagccgtgtg ttggctgtcc atttcatgga 
gactgtgacc tttgactggt atgggaaaaa 
catcattgct ggtgtgcacg gtattgaggc 
gatcctcaga gtttcccaat cactaacaga 
aaagaaagcc agggcggtcc tcgccagcgc 
ggctcgcttc cttctctggc atgctacatc 
cgtggctcgg tacaccactt tcaattattg 
tattacacca cagagaagat tgcagaagtt 
tgccctaggg ctcattgctg ttggattagc 
ctaacagttt tttttttttt tttttttttt 
ttaacgaccc cgccgatgtg agtttggcga 
gccatggtct gaaggggatg acgtcccttc 
gtgaggagtc ctggctgtgt gggaagcagt 
ctcacgacgt atttgtccgc tgtgcagagc 
ttccaagcgg agggcaaccc ccgcttggaa 



aatgaccgaa acaggcggag gccctgatga 7080 
ctctgaatgg tcagacgaaa gttggtcgac 7140 
ccccccgtac cctaagatac ggggaaagga 7200 
tacaaaaaag aagttgggaa agagtgagtt 7260 
cgtgattagc ttcaaaactg cttctaaagt 7320 
tttcctcaaa caaagatcat tggtgtatgt 7380 
acaaaaagtc actattaata gacaacctct 744 0 
attggctaag gaaaaagctt caaaagttgt 7500 
agctcacacg ccctctaagt ctgctaagtc 7560 
tcgttctgga gcagcccgca aggctgttct 7620 
gataccgagt cattatcggc aaactgtgat 7680 
cccccagaaa ccaacaaaga aacccccaag .7740 
atgtgttgag aagatgtact acggtcaggt ,7800 
agatgcgtac gggtttgtag atccacgtac 7860 
acccgatgca gtcggagcca catgcgatac 7920 
ggatatcatg gtggagacag acatctactc 7980 
tggcattcac accattgcga ggcagttata 8040 
ccgagagatc ggatatcgta ggtgtaggtc 8100 
cagtttgacc tgctggctga aggtaaatgc 6160 
tcgcttcctt atttgcggcg atgattgcac 8220 
agacaaacaa gcaatgcgtg tctttgctag 6280 
ttgtgtgcct caacccaaat acagtttgga 8340 
ctctggaatt accaaaagtg gcaagcctta 8400 
ccttggcagg tgctctgccg agggtctggg 8460 
tctaatacat cactacccat gtttgtgggt 8520 
gcagatgctc tttgaggaca aacttcccga 8580 
ttatacggtg cctgtagaag atctgcccag 8640 
tttctcggtg gtgcgctaca ccaacgctga 8700 
catgaccatg ccccccctgc gagcctggcg 8760 
caagaggcgt ggcggagcac acgcaaaatt 6820 
tagaccCcta ccagatttigg ataagacgag 8880 
tgatgtttac tccccggagg gggatgtgtt 894 0 
ccttgtgaag tatttggctg tcattgtttt 9000 
catcagctga acccccaaat tcaaaattaa 9060 
agggcagcgg caacagggga gaccccgggc 9120 
ccatggtgga tcagaaccgt ttcgggtgaa 9180 
tggctcatcc acaaaaaccg tctcgggtgg 9240 
cagtataatt cccgtcgtgt gtggtgacgc 9300 
gtagtaccaa gggctgcacc ccggtttttg 9360 
ttaaaaact 9399 



<210> 2 

<211> 2864 

<212> PRT 

<213> GBV-B virus 

<400> 2 
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Met Pro Val He Ser Thr Gin Thr Ser Pro Val Pro Ala Pro Arg Thr 
15 10 15 

Arg Lys Asn Lys Gin Thr Gin Ala Ser Tyr Pro Val Ser He Lys Thr 
20 25 30 

Ser Val Glu Arg Gly Gin Arg Ala Lys Arg Lys Val Gin Arg Asp Ala * 
35 40 45 

Arg Pro Arg Asn Tyr Lys He Ala Gly He His Asp Gly Leu Gin Thr 
50 55 60 

Leu Ala Gin Ala Ala Leu Pro Ala His Gly Trp Gly Arg Gin Asp Pro 
65 70 75 80 

Arg His Lys Ser Arg Asn Leu Gly He Leu Leu Asp Tyr Pro Leu Gly 
85 90 95 

Trp He Gly Asp Val Thr Thr His Thr Pro Leu Val Gly Pro Leu Val 
100 105 110 

Ala Gly Ala Val Val Arg Pro Val Cys Gin He Val Arg Leu Leu Glu 
115 120 125 

Asp Gly Val Asn Trp Ala Thr Gly Trp Phe Gly Val His Leu Phe Val 
130 135 140 

Val Cys Leu Leu Ser Leu Ala Cys Pro Cys Ser Gly Ala Arg Val Thr 
145 150 155 160 

Asp Pro Asp Thr Asn Thr Thr He Leu Thr Asn Cys Cys Gin Arg Asn 
165 170 175 

Gin Val He Tyr Cys Ser Pro Ser Thr Cys Leu His Glu Pro Gly Cys 
180 185 190 

Val He Cys Ala Asp Glu Cys Trp Val Pro Ala Asn Pro Tyr He Ser 
195 200 205 

His Pro Ser Asn Trp Thr Gly Thr Asp Ser Phe Leu Ala Asp His He 
210 215 220 

Asp Phe Val Met Gly Ala Leu Val Thr Cys Asp Ala Leu Asp He Gly 
225 230 235 240 

Glu Leu Cys Gly Ala Cys Val Leu Val Gly Asp Trp Leu Val Arg His 
245 250 255 
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Trp Leu lie His lie Asp Leu Asn Glu Thr Gly Thr Cys Tyr Leu Glu 
260 265 270 

Val Pro Thr Gly lie Asp Pro Gly Phe Leu Gly Phe lie Gly Trp Met 
275 280 285 

Ala Gly Lys Val Glu Ala val lie Phe Leu Thr Lys Leu Ala Ser Gin 
290 295 300 

Val Pro Tyr Ala lie Ala Thr Met Phe Ser Ser Val His Tyr Leu Ala 
305 310 315 320 

Val Gly Ala Leu lie Tyr Tyr Ala Ser Arg Gly Lys Trp Tyr Gin Leu 
325 330 335 

Leu Leu Ala Leu Met Leu Tyr lie Glu Ala Thr Ser Gly Asn Pro lie 
340 345 350 

Arg Val Pro Thr Gly Cys Ser lie Ala Glu Phe Cys Ser Pro Leu Met 
355 360 365 

He Pro Cys Pro Cys His Ser Tyr Leu Ser Glu Asn Val Ser Glu Val 
370 ' 375 380 

He Cys Tyr Ser Pro Lys Trp Thr Arg Pro He Thr Leu Olu Tyr Asn 
365 390 395 400 

Asn Ser He Ser Trp Tyr Pro Tyr Thr He Pro Gly Ala Arg Gly Cys 
405 410 415 

Net Val Lys Phe Lys Asn Asn Thr Trp Gly Cys Cys Arg He Arg Asn 
420 425 430 

Val Pro Ser Tyr Cys Thr Met Gly Thr Asp Ala Val Trp Asn Asp Thr 
435 440 445 

Arg Asn Thr Tyr Glu Ala Cys Gly Val Thr Pro Trp Leu Thr Thr Ala 
450 455 460 

Trp His Asn Gly Ser Ala Leu Lys Leu Ala He Leu Gin Tyr Pro Gly 
465 470 475 480 

Ser Lys Glu Met Phe Lys Pro His Asn Trp Met Ser Gly His Leu Tyr 
485 490 495 

Phe Glu Gly Ser Asp Thr Pro He Val Tyr Phe Tyr Asp Pro Val Asn 
500 505 510 
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Ser Thr Leu Leu Pro Pro Glu Arg Trp Ala Arg Leu Pro Gly Thr Pro 
515 520 525 

Pro Val Val Arg Gly Ser Trp Leu Gin Val Pro Gin Gly Phe Tyr Ser 
530 535 540 

Asp Val Lys Asp Leu Ala Thr Gly Leu lie Thr Lys Asp Lys Ala Trp 
545 550 555 560 

Lys Asn Tyr Gin Val Leu Tyr Ser Ala Thr Gly Ala Leu Ser Leu Thr 
565 570 575 

Gly Val Thr Thr Lys Ala Val Val Leu He Leu Leu Gly Leu Cys Gly 
580 585 590 

Ser Lys Tyr Leu He Leu Ala Tyr Leu Cys Tyr Leu Ser Leu Cys Phe 
595 600 605 

Gly Arg Ala Ser Gly Tyr Pro Leu Arg Pro Val Leu Pro Ser Gin Ser 
610 615 620 

Tyr Leu Gin Ala Gly Trp Asp Val Leu Ser Lys Ala Gin Val Ala Pro 
625 630 635 640 

Phe Ala Leu He Phe Phe He Cys Cys Tyr Leu Arg Cys Arg Leu Arg 
645 650 655 

Tyr Ala Ala Leu Leu Gly Phe Val Pro Met Ala Ala Gly Leu Pro Leu 
660 665 670 

Thr Phe Phe Val Ala Ala Ala Ala Ala Gin Pro Asp Tyr Asp Trp Trp 
675 660 685 

Val Arg Leu Leu Val Ala Gly Leu Val Leu Trp Ala Gly Arg Asn Arg 
690 695 700 

Gly His Arg He Ala Leu Leu Val Gly Pro Trp Pro Leu Val Ala Leu 
705 710 715 720 

Leu Thr Leu Leu His Leu Val Thr Pro Ala Ser Ala Phe Asp Thr Glu 
725 730 735 

He He Gly Gly Leu Thr He Pro Pro Val Val Ala Leu Val Val Met 
740 745 750 



Ser Arg Phe Gly Phe Phe Ala His Leu Leu Pro Arg Cys Ala Leu Val 
755 760 765 
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Asn Ser Tyr Leu Trp Gin Axg Trp Glu Asn Trp Phe Trp Asn Val Thr 
770 775 780 

Leu Arg Pro Glu Arg Phe Phe Leu Val Leu Val Cys Phe Pro Gly Ala 
785 790 795 800 

Thr Tyr Asp Ala Leu Val Thr Phe Cys Val Cys His Val Ala Leu Leu 
805 810 815 

Cys Leu Thr Ser Ser Ala Ala Ser Phe Phe Gly Thr Asp Ser Arg Val 
820 825 830 

Arg Ala His Arg Met Leu Val Arg Leu Gly Lys Cys His Ala Trp Tyr 
835 840 845 

Ser His Tyr Val Leu Lys Phe Phe Leu Leu Val Phe Gly Glu Asn Gly 
850 855 660 

Val Phe Phe Tyr Lys His Leu His Gly Asp Val Leu Pro Asn Asp Phe 
865 870 875 880 

Ala Ser Lys Leu Pro Leu Gin Glu Pro Phe Phe Pro Phe Glu Gly Lys 
885 890 895 

Ala Arg Val Tyr Arg Asn Glu Gly Arg Arg Leu Ala Cys Gly Asp Thr 
900 905 910 

Val Asp Gly Leu Pro Val Val Ala Arg Leu Gly Asp Leu Val Phe Ala 
915 920 925 

Gly Leu Ala Met Pro Pro Asp Gly Trp Ala lie Thr Ala Pro Phe Thr 
930 935 940 

Leu Gin Cys Leu Ser Glu Arg Gly Thr Leu Ser Ala Met Ala Val Val 
945 950 955 960 

Met Thr Gly lie Asp Pro Arg Thr Trp Thr Gly Thr lie Phe Arg Leu 
965 970 975 

Gly Ser Leu Ala Thr Ser Tyr Met Gly Phe Val Cys Asp Asn Val Leu 
980 985 990 

Tyr Thr Ala His His Gly Ser Lys Gly Arg Arg Leu Ala His Pro Thr 
995 1000 1005 

Gly Ser lie His Pro He Thr Val Asp Ala Ala Asn Asp Gin Asp He 
1010 1015 1020 
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Tyr Gin Pro Pro Cys Gly Ala Gly Ser Leu Thr Arg Cys Ser Cys Gly 
1025 1030 1035 1040 

Glu Thr hys Gly Tyr Leu Val Thr Arg Leu Gly Ser Leu Val Glu Val 
1045 1050 1055 

Asn Lys Ser Asp Asp Pro Tyr Trp Cys Val Cys Gly Ala Leu Pro Met 
1060 1065 1070 

Ala Val Ala Lys Gly Ser Ser Gly Ala Pro lie Leu Cys Ser Ser Gly 
1075 1080 1085 

His Val He Gly Met Phe Thr Ala Ala Arg Asn Ser Gly Gly Ser Val 
1090 1095 1100 

Ser Gin He Arg Val Arg Pro Leu Val Cys Ala Gly Tyr His Pro Gin 
1105 1110 1115 1120 

Tyr Thr Ala His Ala Thr Leu Asp Thr Lys Pro Thr Val Pro Asn Glu 
1125 1130 1135 

Tyr Ser Val Gin He Leu He Ala Pro Thr Gly Ser Gly Lys Ser Thr 
1140 1145 1150 

Lys Leu Pro Leu Ser Tyr Met Gin Glu Lys Tyr Glu Val Leu Val Leu 
1155 1160 1165 

Asn Pro Ser Val Ala Thr Thr Ala Ser Met Pro Lys Tyr Met His Ala 
1170 1175 1180 

Thr Tyr Gly Val Asn Pro Asn Cys Tyr Phe Asn Gly Lys Cys Thr Asn 
11B5 1190 1195 1200 

Thr Gly Ala Ser Leu Thr Tyr Ser Thr Tyr Gly Met Tyr Leu Thr Gly 
1205 1210 1215 

Ala Cys Ser Arg Asn Tyr Asp Val He He Cys Asp Glu Cys His Ala 
1220 1225 1230 

Thr Asp Ala Thr Thr Val Leu Gly He Gly Lys Val Leu Thr Glu Ala 
1235 1240 1245 

Pro Ser Lys Asn Val Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro 
1250 1255 1260 

Gly Val He Pro Thr Pro His Ala Asn He Thr Glu He Gin Leu Thr 
1265 1270 1275 1280 



9 



BNSDOCID;<WO 007S337AIJ > 



wo 00/75337 PCT/USOO/15293 

Asp Glu Gly Thr lie Pro Phe His Gly Lys Lys lie Lys Glu Glu Asn 
1285 1290 1295 

Leu Lys Lys Gly Arg His Leu lie Phe Glu Ala Thr Lys Lys His Cys 
1300 1305 1310 

Asp Glu Leu Ala Asn Glu Leu Ala Arg Lys Gly lie Thr Ala Val Ser 
1315 1320 1325 

Tyr Tyr Arg Gly Cys Asp lie Ser Lys lie Pro Glu Gly Asp Cys Val 
1330 1335 1340 

Val Val Ala Thr Asp Ala Leu Cys Thr Gly Tyr Thr Gly Asp Phe Asp 
1345 1350 1355 1360 

Ser Val Tyr Asp Cys Ser Leu Met Val Glu Gly Thr Cys His Val Asp . 

1365 1370 1375 

Leu Asp Pro Thr Phe Thr Met Gly Val Arg Val Cys Gly Val Ser Ala 
1380 1385 1390 

He Val Lys Gly Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Ala Gly 
1395 1400 1405 

lie Tyr Tyr Tyr Val Asp Gly Ser Cys Thr Pro Ser Gly Met Val Pro 
1410 1415 1420 

Glu Cys Asn lie Val Glu Ala Phe Asp Ala Ala Lys Ala Trp Tyr Gly 
1425 1430 1435 1440 

Leu Ser Ser Thr Glu Ala Gin Thr He Leu Asp Thr Tyr Arg Thr Gin 
1445 1450 1455 

Pro Gly Leu Pro Ala He Gly Ala Asn Leu Asp Glu Trp Ala Asp Leu 
1460 1465 1470 

Phe Ser Met Val Asn Pro Glu Pro Ser Phe Val Asn Thr Ala Lys Arg 
1475 1480 1485 

Thr Ala Asp Asn Tyr Val Leu Leu Thr Ala Ala Gin Leu Gin Leu Cys 
1490 1495 1500 

His Gin Tyr Gly Tyr Ala Ala Pro Asn Asp Ala Pro Arg Trp Gin Gly 
1505 1510 1515 1520 

Ala Arg Leu Gly Lys Lys Pro Cys Gly Val Leu Trp Arg Leu Asp Gly 
1525 1530 1535 
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Ala Asp Ala Cys Pro Gly Pro Glu Pro Ser Glu Val Thr Arg Tyr Gin 
1540 1545 1550 

Met Cys Phe Thr Glu Val Asn Thr Ser Gly Thr Ala Ala Leu Ala Val 
1555 1560 1565 

Gly Val Gly Val Ala Met Ala Tyr Leu Ala lie Asp Thr Phe Gly Ala 
1570 1575 1580 

Thr Cys Val Arg Arg Cys Trp Ser lie Thr Ser Val Pro Thr Gly Ala 
1585 1590 1595 1600 

Thr Val Ala Pro Val Val Asp Glu Glu Glu He Val Glu Glu Cys Ala 
1605 1610 1615 

Ser Phe lie Pro Leu Glu Ala Met Val Ala Ala lie Asp Lys Leu Lys - 
1620 1625 1630 

Ser Thr He Thr Thr Thr Ser Pro Phe Thr Leu Glu Thr Ala Leu Glu 
1635 1640 1645 

Lys Leu Asn Thr Phe Leu Gly Pro His Ala Ala Thr He Leu Ala He 
1650 1655 1660 

He Glu Tyr Cys Cys Gly Leu Val Thr Leu Pro Asp Asn Pro Phe Ala 
1665 1670 1675 1680 . 

Ser Cys Val Phe Ala Phe He Ala Gly He Thr Thr Pro Leu Pro His 
1685 1690 1695 

Lys He Lys Met Phe Leu Ser Leu Phe Gly Gly Ala He Ala Ser Lys 
1700 1705 1710 

Leu Thr Asp Ala Arg Gly Ala Leu Ala Phe Met Met Ala Gly Ala Ala 
1715 1720 1725 

Gly Thr Ala Leu Gly Thr Trp Thr Ser Val Gly Phe Val Phe Asp Met 
1730 1735 1740 

Leu Gly Gly Tyr Ala Ala Ala Ser Ser Thr Ala Cys Leu Thr Phe Lys 
1745 1750 1755 1760 

Cys Leu Met Gly Glu Trp Pro Thr Met Asp Gin Leu Ala Gly Leu Val 
1765 1770 1775 

Tyr Ser Ala Phe Asn Pro Ala Ala Gly Val Val Gly Val Leu Ser Ala 
1780 1785 1790 
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Cys Ala Met Phe Ala Leu Thr Thr Ala Gly Pro Asp His Trp Pro Asn 
1795 1800 1805 

Arg Leu Leu Thr Met Leu Ala Arg Ser Asn Thr Val Cys Asn Glu Tyr 
1810 1815 1820 

Phe lie Ala Thr Arg Asp lie Arg Arg Lys lie Leu Gly lie Leu Glu 
1825 1830 1835 1840 

Ala Ser Thr Pro Trp Ser Val lie Ser Ala Cys lie Arg Trp Leu His 
1845 1650 1855 

Thr Pro Thr Glu Asp Asp Cys Gly Leu lie Ala Trp Gly Leu Glu lie 
1860 1865 1870 

Trp Gin Tyr Val Cys Asn Phe Phe Val lie Cys Phe Asn Val Leu Lys 
1875 1880 1885 

Ala Gly Val Gin Ser Met Val Asn lie Pro Gly Cys Pro Phe Tyr Ser 
1690 1895 1900 

Cys Gin Lys Gly Tyr Lys Gly Pro Trp lie Gly Ser Gly Met Leu Gin 
1905 1910 1915 1920 

Ala Arg Cys Pro Cys Gly Ala Glu Leu lie Phe Ser Val Glu Asn Gly 
1925 1930 1935 

Phe Ala Lys Leu Tyr Lys Gly Pro Arg Thr Cys Ser Asn Tyr Trp Arg 
1940 1945 1950 

Gly Ala Val Pro Val Asn Ala Arg Leu Cys Gly Ser Ala Arg Pro Asp 
1955 1960 1965 

Pro Thr Asp Trp Thr Ser Leu Val Val Asn Tyr Gly Val Arg Asp Tyr 
1970 1975 1980 

Cys Lys Tyr Glu Lys Met Gly Asp His He Phe Val Thr Ala Val Ser 
1985 1990 1995 2000 

Ser Pro Asn Val Cys Phe Thr Gin Val Pro Pro Thr Leu Arg Ala Ala 
2005 2010 2015 

Val Ala Val Asp Gly Val Gin Val Gin Cys Tyr Leu Gly Glu Pro Lys 
2020 2025 2030 

Thr Pro Trp Thr Thr Ser Ala Cys Cys Tyr Gly Pro Asp Gly Lys Gly 
2035 2040 2045 
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Lys Thr Val Lys Leu Pro Phe Arg Val Asp Gly His Thr Pro Gly Val 
2050 2055 2060 

Arg Met Gin Leu Asn Leu Arg Asp Ala Leu Glu Thr Asn Asp Cys Asn 
2065 2070 2075 2080 

Ser Thr Asn Asn Thr Pro Ser Asp Glu Ala Ala Val Ser Ala Leu Val 
2085 2090 2095 

Phe Lys Gin Glu Leu Arg Arg Thr Asn Gin Leu Leu Glu Ala lie Ser 
2100 2105 2110 

Ala Gly Val Asp Thr Thr Lys Leu Pro Ala Pro Ser lie Glu Glu Val 
2115 2120 2125 

Val Val Arg Lys Arg Gin Phe Arg Ala Arg Thr Gly Ser Leu Thr Leu 
2130 2135 2140 

Pro Pro Pro Pro Arg Ser Val Pro Gly Val Ser Cys Pro Glu Ser Leu 
2145 2150 2155 2160 

Gin Arg Ser Asp Pro Leu Glu Gly Pro Ser Asn liCu Pro Pro Ser Pro 
2165 2170 2175 

Pro Val Leu Gin Leu Ala Met Pro Met Pro Leu Leu Gly Ala Gly Glu 
2180 2185 2190 

Cys Asn Pro Phe Thr Ala lie Gly Cys Ala Met Thr Glu Thr Gly Gly 
2195 2200 2205 

Gly Pro Asp Asp Leu Pro Ser Tyr Pro Pro Lys Lys Glu val Ser Glu 
2210 2215 2220 

Trp Ser Asp Glu Ser Trp Ser Thr Ala Thr Thr Ala Ser Ser Tyr Val 
2225 2230 2235 2240 

Thr Gly Pro Pro Tyr Pro Lys He Arg Gly Lys Asp Ser Thr Gin Ser 
2245 2250 2255 

Ala Pro Ala Lys Arg Pro Thr Lys Lys Lys Leu Gly Lys Ser Glu Phe 
2260 2265 2270 

Ser Cys Ser Met Ser Tyr Thr Trp Thr Asp Val He Ser Phe Lys Thr 
2275 2280 2285 

Ala Ser Lys Val Leu Ser Ala Thr Arg Ala He Thr Ser Gly Phe Leu 
2290 2295 2300 
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Lys Gin Arg Ser Leu Val Tyr Val Thr Glu Pro Arg Asp Ala Glu Leu 
2305 2310 2315 2320 

Arg Lys Gin Lys Val Thr lie Asn Arg Gin Pro Leu Phe Pro Pro Ser 
2325 2330 2335 

Tyr His Lys Gin Val Arg Leu Ala Lys Glu Lys Ala Ser Lys Val Val 
2340 2345 2350 

Gly Val Met Trp Asp Tyr Asp Glu Val Ala Ala His Thr Pro Ser Lys 
2355 2360 2365 

Ser Ala Lys Ser His He Thr Gly Leu Arg Gly Thr Asp Val Arg Ser 
2370 2375 2380 

Gly Ala Ala Arg Lys Ala Val Leu Asp Leu Gin Lys Cys Val Glu Ala 
2385 2390 2395 2400 

Gly Glu He Pro Ser His Tyr Arg Gin Thr Val He Val Pro Lys Glu 
2405 2410 2415 

Glu Val Phe Val Lys Thr Pro Gin Lys Pro Thr Lys Lys Pro Pro Arg 
2420 2425 2430 

Leu He Ser Tyr Pro His Leu Glu Met Arg Cys Val Glu Lys Met Tyr 
2435 2440 2445 

Tyr Gly Gin Val Ala Pro Asp Val Val Lys Ala Val Met Gly Asp Ala 
2450 2455 2460 

Tyr Gly Phe Val Asp Pro Arg Thr Arg Val Lys Arg Leu Leu Ser Met 
2465 2470 2475 2480 

Trp Ser Pro Asp Ala Val Gly Ala Thr Cys Asp Thr Val Cys Phe Asp 
2485 2490 2495 

Ser Thr He Thr Pro Glu Asp He Met Val Glu Thr Asp He Tyr Ser 
2500 2505 2510 

Ala Ala Lys Leu Ser Asp Gin His Arg Ala Gly He His Thr He Ala 
2515 2520 2525 

Arg Gin Leu Tyr Ala Gly Gly Pro Met He Ala Tyr Asp Gly Arg Glu 
2530 2535 2540 

He Gly Tyr Arg Arg Cys Arg Ser Ser Gly Val Tyr Thr Thr Ser Ser 
2545 2550 2555 2560 
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Ser Asn Ser Leu Thr Cys Trp Leu Lys Val Asn Ala Ala Ala Glu Gin 
2565 2570 2575 

Ala Gly Met Lys Asn Pro Arg Phe Leu He Cys Gly Asp Asp Cys Thr 
2580 2585 2590 

Val He Trp Lys Ser Ala Gly Ala Asp Ala Asp Lys Gin Ala Met Arg 
2595 2600 2605 

Val Phe Ala Ser Trp Met Lys Val Met Gly Ala Pro Gin Asp Cys Val 
2610 2615 2620 

Pro Gin Pro Lys Tyr Ser Leu Glu Glu Leu Thr Ser Cys Ser Ser Asn 
2625 2630 2635 2640 

Val Thr Ser Gly He Thr Lys Ser Gly Lys Pro Tyr Tyr Phe Leu Thr 
2645 2650 2655 

Arg Asp Pro Arg He Pro Leu Gly Arg Cys Ser Ala Glu Gly Leu Gly 
2660 2665 2670 

Tyr Asn Pro Ser Ala Ala Trp He Gly Tyr Leu He His His Tyr Pro 
2675 2680 2685 

Cys Leu Trp Val Ser Arg Val Leu Ala Val His Phe Met Glu Gin Met 
2690 2695 2700 

Leu Phe Glu Asp Lys Leu Pro Glu Thr Val Thr Phe Asp Trp Tyr Gly 
2705 2710 2715 2720 

Lys Asn Tyr Thr Val Pro Val Glu Asp Leu Pro Ser He He Ala Gly 
2725 2730 2735 

Val His Gly He Glu Ala Phe Ser Val Val Arg Tyr Thr Asn Ala Glu 
2740 2745 2750 

He Leu Arg Val Ser Gin Ser Leu Thr Asp Met Thr Met Pro Pro Leu 
2755 2760 2765 

Arg Ala Trp Arg Lys Lys Ala Arg Ala Val Leu Ala Ser Ala Lys Arg 
2770 2775 2780 

Arg Gly Gly Ala His Ala Lys Leu Ala Arg Phe Leu Leu Trp His Ala 
2785 2790 2795 2800 

Thr Ser Arg Pro Leu Pro Asp Leu Asp Lys Thr Ser Val Ala Arg Tyr 
2805 2810 2815 
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Thr Thr Phe Asn Tyr Cys Asp Val Tyr Ser Pro Glu Gly Asp Val Phe 
2820 2B25 2830 

lie Thr Pro Gin Arg Arg Leu Gin Lys Phe Leu Val Lys Tyr Leu Ala 
2835 2840 2845 

Val He Val Phe Ala Leu Gly Leu He Ala Val Gly Leu Ala He Ser 
2850 2855 2860 



<210> 3 
<211> 9139 
<212> DNA 
<213> GBV-B virus 

<400> 3 

accacaaaca ctccagtttg ttacactccg 
cagggcgtgg gggatttccc ctgcccgtct 
gtaggcggcg ggactcatga cgctcgcgtg 
cctgatgggc gttcatgggt tcggtggtgg 
cctcccagat agagcggcgg cactgtaggg 
cagacctctt tttgagtatc acgcctccgg 
tgggatggtt ggggttagcc atccataccg 
ctgggagtct cgtagaccgt agcacatgcc 
tgcgcccaga acgcgcaaga acaagcagac 
atctgttgaa aggggacaac gagcaaagcg 
ttacaaaatt gctggtatcc atgatggctt 
tcatggttgg ggacgccaag accctcgcca 
ttaccctttg gggtggattg gtgatgttac 
ggcaggagcg gtcgttcgac cagtctgcca 
ctgggctact ggttggttcg gtgtccacct 
tccctgtagt ggggcgcggg tcactgaccc 
ctgccagcgt aatcaggtta tctattgttc 
tgtgatctgt gcggacgagt gctgggttcc 
ttggactggc acggactcct tcttggctga 
gacctgtgac gcccttgaca ttggtgagtt 
gcttgtcagg cactggctta ttcacataga 
agtgcccact ggaatagatc ctgggttcct 
cgaggctgtc atcttcttga ccaaactggc 
gtttagcagt gtacactacc tggcggttgg 
gtggtatcag ttgctcctag cgcttatgct 
cagggtgccc actggatgct caaCagctga 
ttgccactct tatttgagtg agaatgtgtc 
caggcctatc actctagagt ataacaactc 
tgcgagggga tgtatggtta aattcaaaaa 
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ctaggaatgc 


tcctggagca 


ccccccctag 


60 


gcagaagggt 


ggagccaacc 


accttagtat 


120 


atgacaagcg 


ccaagcttga 


cttggatggc 


180 


tggcgcttta 


ggcagcctcc 


acgcccacca 


240 


aagaccgggg 


accggtcact 


accaaggacg 


300 


aagtagttgg 


gcaagcccac 


ctatatgtgt 


360 


tactgcctga 


tagggtcctt 


gcgaggggat 


420 


tgttatttct 


actcaaacaa 


gtcctgtacc 


480 


gcaggcttca 


tatcctgtgt 


ccattaaaac 


540 


caaagtccag 


cgcgatgctc 


ggcctcgtaa 


600 


gcagacattg 


gctcaggctg 


ctttgccagc 


660 


taagtctcgc 


aatcttggaa 


tccttctgga 


720 


aactcacaca 


cctctagtag 


gcccgctggt 


780 


gatagtacgc 


ttgctggagg 


atggagtcaa 


840 


ttttgtggta 


tgtctgctat 


ctttggcctg 


900 


agacacaaat 


accacaatcc 


tgaccaattg 


960 


tccttccact 


tgcctacacg 


agcctggttg 


1020 


cgccaatccg 


tacatctcac 


acccttccaa 


1080 


ccacattgat 


tttgttatgg 


gcgctcttgt 


1140 


gtgtggtgcg 


tgtgtattag 


tcggtgactg 


1200 


cctcaatgaa 


actggtactt 


gttacctgga 


1260 


agggtttatc 


gggtggatgg 


ccggcaaggt 


1320 


ttcacaagta 


ccatacgcta 


ttgcgactat 


1380 


cgctctgatc 


tactatgcct 


ctcggggcaa 


1440 


ttacatagaa 


gcgacctctg 


gaaaccccat 


1500 


gttttgctcg 


cctttgatga 


taccatgtcc 


1560 


agaagtcatt 


tgttacagtc 


caaagtggac 


1620 


catatcttgg 


tacccctata 


caatccctgg 


1680 


taacacatgg 


ggttgctgcc 


gtattcgcaa 
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tgtgccatcg tactgcacta tgggcactga tgcagtgtgg aacgacactc gcaacactta 1800 
cgaagcatgc ggtgtaacac catggctaac aaccgcatgg cacaacggct cagccctgaa 1860 
attggctata ttacaatacc ctgggtctaa agaaatgttt aaacctcata attggatgtc 1920 
aggccatttg tattttgagg gatcagatac ccctatagtt tacttttatg accctgtgaa 1980 
ttccactctc ctaccaccgg agaggtgggc taggttgccc ggtaccccac ctgtggtacg 2040 
tggttcttgg ttacaggttc cgcaagggtt ttacagtgat gtgaaagacc tagccacagg 2100 
attgatcacc aaagacaaag cctggaaaaa ttatcaggtc ttatattccg ccacgggtgc 2160 
tttgtctctt acgggagtta ccaccaaggc cgtggtgcta attctgttgg ggttgtgtgg 2220 
cagcaagtat cttattttag cctacctctg ttacttgtcc ctttgttttg ggcgcgcttc 2280 
tggttaccct ttgcgtcctg tgctcccatc ccagtcgtat ctccaagctg gctgggatgt 2340 
tttgtctaaa gctcaagtag ctccttttgc tttgattttc ttcatctgtt gctatctccg 2400 
ctgcaggcta cgttatgctg cccttttagg gtttgtgccc atggctgcgg gcttgcccct 2460 
aactttcttt gttgcagcag ctgctgccca accagattat gactggtggg tgcgactgct 2520 
agtggcaggg ttagttttgt gggccggccg taaccgtggt caccgcatag ctctgcttgt 2580 
aggtccttgg cctctggtag cgcttttaac cctcttgcat ttggttacgc ctgcttcagc 2640 
ttttgatacc gagataattg gagggctgac aataccacct gtagtagcat tagttgtcat 2700 
gtctcgtttt ggcttctttg ctcacttgtt acctcgctgt gctttagtta actcctatct 2760 
ttggcaacgt tgggagaatt ggttttggaa cgttacacta agaccggaga ggtttttcct 2820 
tgtgctggtt tgtttccccg gtgcgacata tgacgcgctg gtgactttct gtgtgtgtca 2880 
cgtagctctt ctatgtttaa catccagtgc agcatcgttc tttgggactg actctagggt 2940 
tagggcccat agaatgttgg tgcgtctcgg aaagtgtcat gcttggtatt ctcattatgt 3000 
tcttaagttt ttcctcttag tgtttggtga gaatggtgtg tttttctata agcacttgca 3060 
tggtgatgtc ttgcctaatg attttgcctc gaaactacca ttgcaagagc catttttccc 3120 
ttttgaaggc aaggcaaggg tctataggaa tgaaggaaga cgcttggcgt gtggggacac 3180 
ggttgatggt ttgcccgttg ttgcgcgtct cggcgacctt gttttcgcag ggttggctat 3240 
gccgccagat gggtgggcca ttaccgcacc ttttacgctg cagtgtctct ctgaacgtgg 3300 
cacgctgtca gcgatggcag tggtcatgac tggtatagac ccccgaactt ggactggaac 3360 
tatcttcaga ttaggatctc tggccactag ctacatggga tttgtttgtg acaacgtgtt 3420 
gtatactgct caccatggca gcaaggggcg ccggttggct catcccacag gctctataca 3480 
cccaataacc gttgacgcgg ctaatgacca ggacatctat caaccaccat gtggagctgg 3540 
gtcccttact cggtgctctt gcggggagac caaggggtat ctggtaacac gactggggtc 3600 
attggttgag gtcaacaaat ccgatgaccc ttattggtgt gtgtgcgggg cccttcccat 3660 
ggctgttgcc aagggttctt caggtgcccc gattctgtgc tcctccgggc atgttattgg 3720 
gatgttcacc gctgctagaa attctggcgg ttcagtcagt cagattaggg ttaggccgtt 3780 
ggtgtgtgct ggataccatc cccagtacac agcacatgcc actcttgata caaaacctac 3640 
tgtgcctaac gagtattcag tgcaaatttt aattgccccc actggcagcg gcaagtcaac 3900 
caaattacca ctttcttaca tgcaggagaa gtatgaggtc ttggtcctaa atcccagtgt 3960 
ggctacaaca gcatcaatgc caaagtacat gcacgcgacg tacggcgtga atccaaattg 4020 
ctattttaat ggcaaatgta ccaacacagg ggcttcactt acgtacagca catatggcat 4080 
gtacctgacc ggagcatgtt cccggaacta tgatgtaatc atttgtgacg aatgccatgc 4140 
taccgatgca accaccgtgt tgggcattgg aaaggtccta accgaagctc catccaaaaa 4200 
tgttaggcta gtggttcttg ccacggctac cccccctgga gtaatcccta caccacatgc 4260 
caacataact gagattcaat taaccgatga aggcactatc ccctttcatg gaaaaaagat 4320 
taaggaggaa aatctgaaga aagggagaca ccttatcttt gaggctacca aaaaacactg 4380 
tgatgagctt gctaacgagt tagctcgaaa gggaataaca gctgtctctt actatagggg 4440 
atgtgacatc tcaaaaatcc ctgagggcga ctgtgtagta gttgccactg atgccttgtg 4500 
tacagggtac actggtgact ttgattccgt gtatgactgc agcctcatgg tagaaggcac 4560 
atgccatgtt gaccttgacc ctactttcac catgggtgtt cgtgtgtgcg gggtttcagc 4620 
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aatagttaaa ggccagcgta ggggccgcac 
tgtagacggg agttgtaccc cttcgggtat 
cgacgcagcc aaggcatggt atggtttgtc 
ctatcgcacc caacctgggt tacctgcgat 
cttttctatg gtcaaccccg aaccttcatt 
ttatgttttg ttgactgcag cccaactaca 
caatgacgca ccacggtggc agggagcccg 
gcgcttggac ggcgctgacg cctgtcctgg 
aatgtgcttc actgaagtca atacttctgg 
ggctatggct tatctagcca ttgacacttt 
tattacatca gtccctaccg gtgctactgt 
ggaggagtgt gcatcattca ttcccttgga 
gagcacaatc accacaacta gtcctttcac 
ctttcttggg cctcatgcag ctacaatcct 
cactttacct gacaatccct ttgcatcatg 
cccactacct cacaagatca aaatgttcct 
gcttacagac gctagaggcg cactggcgtt 
tggtacatgg acatcggtgg gttttgtctt 
atccactgct tgcttgacat ttaaatgctt 
tgctggttta gtctactccg cgttcaatcc 
ttgtgcaatg tttgctttga caacagcagg 
tatgcttgct aggagcaaca ctgtatgtaa 
caggaagata ctgggcattc tggaggcatc 
ccgttggctc cacaccccga cggaggatga 
ttggcagtat gtgtgcaatt tctttgtgat 
gagcatggtt aacattcctg gttgtccttt 
ctggattgga tcaggtatgc tccaagcacg 
tgttgagaat ggttttgcaa aactttacaa 
aggggctgtt ccagtcaacg ctaggctgtg 
gactagtctt gtcgtcaatt atggcgttag 
tcacattttt gttacagcag tatcctctcc 
cttgagagct gcagtggccg tggacggcgt 
aactccttgg acgacatctg cttgctgtta 
gcttcccttc cgcgttgacg gtcacacacc 
tgcacttgag acaaatgact gtaattccac 
gtccgctctt gttttcaaac aggagttgcg 
agctggcgtt gacaccacca aactgccagc 
gcgccagttc cgggcaagaa ctggttcgct 
aggagtgtca tgtcctgaaa gcctgcaacg 
ccctccttca ccacctgttc tacagttggc 
gtgtaaccct ttcactgcaa ttggatgtgc 
tttacccagt taccctccca aaaaggaggt 
ggctacaacc gcttccagct acgttactgg 
ttccactcag tcagcccccg ccaaacggcc 
ttcgtgcagc atgagctaca cctggaccga 
tctgtctgca actcgggcca tcactagtgg 
gactgagccg cgggatgcgg agcttagaaa 
gttcccccca tcataccaca agcaagtgag 
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aggccgtggg agagctggca tatactacta 4680 
ggttcctgaa tgcaacattg ttgaagcctt 4 740 
atcaacagaa gctcaaacta ttctggacac 4 800 
aggagcaaat ttggacgagt gggctgatct 4860 
tgtcaatact gcaaaaagaa ctgctgacaa 4920 
actgtgtcat cagtatggct atgctgctcc 4980 
gcttgggaaa aaaccttgtg gggttctgtg 5040 
cccagagccc agcgaggtga ccagatacca 5100 
gacagccgca ctcgctgttg gcgttggagt 5160 
tggcgccact tgtgtgcggc gttgctggtc 5220 
cgccccagtg gttgacgaag aagaaatcgt 5280 
ggccatggtt gctgcaattg acaagctgaa 5340 
atcggaaacc gcccttgaaa aacttaacac 5400 
tgctatcata gagtattgct gtggtttagt 5460 
cgtgtttgct ttcattgcgg gtattactac 5520 
gtcattattt ggaggcgcaa ttgcgtccaa 5580 
catgatggcc ggggctgcgg gaacagctct 5640 
tgacatgcta ggcggctatg ctgccgcctc 5700 
gatgggtgag tggcccacta tggatcagct 5760 
ggccgcagga gttgtgggcg tcttgtcagc 5820 
gccagatcac tggcccaaca gacttcttac 5880 
tgagtacttt attgccactc gtgacatccg 5940 
taccccctgg agtgtcatat cagcttgcat 6000 
ttgcggcctc attgcttggg gtctagagat 6060 
ttgctttaat gtccttaaag ctggagttca 6120 
ctacagctgc cagaaggggt acaagggccc 6180 
ctgtccatgc ggtgctgaac tcatcttttc 6240 
aggacccaga acttgttcaa attactggag 6300 
tgggtcggct agaccggacc caactgattg 6360 
ggactactgt aaatatgaga aaatgggaga 6420 
aaatgtctgt ttcacccagg tgcccccaac 6480 
acaggttcag tgttatctag gtgagcccaa 6540 
cggtcctgac ggtaagggta aaactgttaa 6600 
tggtgtgcgc atgcaactta atttgcgtga 6660 
aaacaacact cctagtgatg aagccgcagt 6720 
gcgtacaaac caattgcttg aggcaatttc 6780 
cccctccatc gaagaggtag tggtaagaaa 6840 
taccttgcct ccccctccga gatccgtccc 6900 
aagtgacccg ttagaaggtc cttcaaacct 6960 
catgccgatg cccctgttgg gagcgggtga 7020 
aatgaccgaa acaggcggag gccctgatga 7080 
ctctgaatgg tcagacgaaa gttggtcgac 7140 
ccccccgtac cctaagatac ggggaaagga 7200 
tacaaaaaag aagttgggaa agagtgagtt 7260 
cgtgattagc ttcaaaactg cttctaaagt 7320 
tttcctcaaa caaagatcat tggtgtatgt 7380 
acaaaaagtc actattaata gacaacctct 744 0 
attggctaag gaaaaagctt caaaagttgt 7500 
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cggtgtcatg tgggactatg atgaagtagc 
ccacatcact ggccttcggg gcactgatgt 
ggacttgcag aagtgtgtcg aggcaggtga 
agttccaaag gaggaggtct tcgtgaagac 
gcttatctcg tacccccacc ttgaaatgag 
tgctcctgac gtagttaaag ctgtcaCggg 
ccgtgtcaag cgtctgttgt cgatgtggtc 
agtgtgtttt gacagtacca tcacacccga 
agcagctaaa ctcagtgacc aacaccgagc 
cgctggagga ccgatgatcg cttatgatgg 
ttccggcgtc tatactacct caagttccaa 
tgcagccgaa caggctggca tgaagaaccc 
cgtaatttgg aagagcgccg gagcagatgc 
ctggatgaag gtgatgggtg caccacaaga 
agaattaaca tcatgctcat caaatgttac 
ctactttctt acaagagatc ctcgtatccc 
atacaacccc agtgctgcgt ggattgggta 
tagccgtgtg ttggctgtcc atttcatgga 
gactgtgacc tttgactggt atgggaaaaa 
catcattgct ggtgtgcacg gtattgaggc 
gatcctcaga gtttcccaat cactaacaga 
aaagaaagcc agggcggtcc tcgccagcgc 
ggctcgcttc cttctctggc atgctacatc 
cgtggctcgg tacaccactt tcaattattg 
tattacacca cagagaagat tgcagaagtt 
tgccctaggg ctcattgctg ttggattagc 
ctaacagttt tttttttttt tttttttttt 
ttaacgaccc cgcgatgtg 



agctcacacg ccctctaagt ctgctaagtc 7560 
tcgttctgga gcagcccgca aggctgttct 7620 
gataccgagt cattatcggc aaactgtgat 7680 
cccccagaaa ccaacaaaga aacccccaag 7740 
atgtgttgag aagatgtact acggtcaggt 7800 
agatgcgtac gggtttgtag atccacgtac 7860 
acccgatgca gtcggagcca catgcgatac 7920 
ggatatcatg gtggagacag acatctactc 7980 
tggcattcac accattgcga ggcagttata 8040 
ccgagagatc ggatatcgta ggtgtaggtc 8100 
cagtttgacc tgctggctga aggtaaatgc 8160 
tcgcttcctt atttgcggcg atgattgcac 8220 
agacaaacaa gcaatgcgtg tctttgctag 8280 
ttgtgtgcct caacccaaat acagtttgga 8340 
ctctggaatt accaaaagtg gcaagcctta 8400 
ccttggcagg tgctctgccg agggtctggg 8460 
tctaatacat cactacccat gtttgtgggt 8520 
gcagatgctc tttgaggaca aacttcccga 8580 
ttatacggtg cctgtagaag atctgcccag 864 0 
tttctcggtg gtgcgctaca ccaacgctga 8700 
catgaccatg ccccccctgc gagcctggcg 8760 
caagaggcgt ggcggagcac acgcaaaatt 8820 
tagacctcta ccagatttgg ataagacgag 8B80 
tgatgtttac tccccggagg gggatgtgtt 8940 
ccttgtgaag tatttggctg tcattgtttt 9000 
catcagctga acccccaaat tcaaaattaa 9060 
agggcagcgg caacagggga gaccccgggc 9120 

9139 



<210> 4 
<211> 9711 
<212> DNA 

<213> Hepatitis C virus 
<400> 4 

acccgcccct aataggggcg acactccgcc 
cttcacgcag aaagcgtcta gccatggcgt 
ccccctcccg ggagagccat agtggtctgc 
aagactgggt cctttcttgg ataaacccac 
caagactgct agccgagtag cgttgggttg 
tgcttgcgag tgccccggga ggtctcgtag 
tcaaagaaaa accaaaagaa acaccaaccg 
cggccagatc gttggcggag tatacttgtt 
cgcgacaagg aagacttcgg agcggtccca 
agatcggcgc tccactggca aatcctgggg 
gaatgaggga ctcggctggg caggatggct 
gggccccaat gacccccggc ataggtcgcg 
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atgaatcact cccctgtgag gaactactgt 60 
tagtatgagt gtcgtacagc ctccaggccc 120 
ggaaccggtg agtacaccgg aattgccggg 180 
tctatgcccg gccatttggg cgtgcccccg 240 
cgaaaggcct tgtggtactg cctgataggg 300 
accgtgcacc atgagcacaa atcctaaacc 360 
tcgcccacaa gacgttaagt ttccgggcgg 420 
gccgcgcagg ggccccaggt tgggtgtgcg 480 
gccacgtgga aggcgccagc ccatccctaa 540 
aaaaccagga tacccctggc ccctatacgg 600 
cctgtccccc cgaggttccc gtccctcttg 660 
caacgtgggt aaggtcatcg ataccctaac 720 
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gtgcggcttt gccgacctca tggggtacat ccctgtcgtg ggcgccccgc tcggcggcgt 780 
cgccagagct ctcgcgcatg gcgtgagagt cctggaggac ggggttaatt ttgcaacagg 840 
gaacttaccc ggttgctcct tttctatctt cttgctggcc ctgctgtcct gcatcaccac 900 
cccggtctcc gctgccgaag tgaagaacat cagtaccggc tacatggtga ctaacgactg 960 
caccaatgac agcattacct ggcagctcca ggctgctgtc ctccacgtcc ccgggtgcgt 1020 
cccgtgcgag aaagtgggga atgcatctca gtgctggata ccggtctcac cgaatgtggc 1080 
cgtgcagcgg cccggcgccc tcacgcaggg cttgcggacg cacatcgaca tggttgtgat 1140 
gtccgccacg ctctgctctg ccctctacgt gggggacctc tgcggtgggg tgatgctcgc 1200 
agcccaaatg ttcattgtct cgccgcagca ccactggttt gtccaagact gcaattgctc 1260 
catctaccct ggtaccatca ctggacaccg catggcatgg gacatgatga tgaactggtc 1320 
gcccacggct accatgatct tggcgtacgc gatgcgtgtc cccgaggtca ttatagacat 1380 
cattagcggg gctcattggg gcgtcatgtt cggcttggcc tacttctcta tgcagggagc 1440 
gtgggcgaaa gtcgttgtca tccttctgtt ggccgccggg gtggacgcgc gcacccatac 1500 
tgttgggggt tctgccgcgc agaccaccgg gcgcctcacc agctcatttg acatgggccc 1560 
caggcagaaa atccagctcg ttaacaccaa tggcagctgg cacatcaacc gcaccgccct 1620 
gaactgcaat gactccttgc acaccggctt tatcgcgtct ctgttctaca cccacagctt 1680 
caactcgtca ggatgtcccg aacgcatgtc cgcctgccgc agtatcgagg ccttccgggt 1740 
gggatggggc gccttgcaat atgaggataa tgtcaccaat ccagaggata tgagacccta 1800 
ttgctggcac tacccaccaa ggcagtgtgg cgtggtctcc gcgaagactg tgtgtggccc 1860 
agtgtactgt ttcaccccca gcccagtggt agtgggcacg accgacaggc ttggagcgcc 1920 
cacttacacg tggggggaga atgagacaga tgtcttccta ttgaacagca ctcgaccacc 1980 
gctggggtca tggttcggct gcacgtggat gaactcttct ggctacacca agacttgcgg 2040 
cgcaccaccc tgccgtacta gagctgactt caacgccagc acggacctgt tgtgccccac 2100 
ggactgtttt aggaagcatc ctgataccac ttacctcaaa tgcggctctg ggccctggct 2160 
cacgccaagg tgcctgatcg actaccccta caggctctgg cattacccct gcacagttaa 2220 
ctataccatc ttcaaaataa ggatgtatgt gggaggggtt gagcacaggc tcacggctgc 2280 
atgcaatttc actcgtgggg atcgttgcaa cttggaggac agagacagaa gtcaactgtc 2340 
tcctttgttg cactccacca cggaatgggc cattttacct tgctcttact cggacctgcc 2400 
cgccttgtcg actggtcttc tccacctcca ccaaaacatc gtggacgtac aattcatgta 2460 
tggcctatca cctgccctca caaaatacat cgtccgatgg gagtgggtaa tactcttatt 2520 
cctgctctta gcggacgcca gggtttgcgc ctgcttatgg atgctcatct tgttgggcca 2580 
ggccgaagca gcactagaga agctggtcat cttgcacgct gcgagcgcag ctagctgcaa 2640 
tggcttccta tattttgtca tctttttcgt ggctgcttgg tacatcaagg gtcgggtagt 2700 
ccccttagct acctattccc tcactggcct gtggtccttt agcctactgc tcctagcatt 2760 
gccccaacag gcttatgctt atgacgcatc tgtgcatggc cagataggag cggctctgct 2820 
ggtaatgatc actctcttta ctctcacccc cgggtataag acccttctca gccggttttt 2880 
gtggtggttg tgctatcttc tgaccctggg ggaagctatg gtccaggagt gggcaccacc 2940 
tatgcaggtg cgcggtggcc gtgatggcat catatgggcc gtcgccatat tctacccagg 3000 
tgtggtgttt gacataacca agtggctctt ggcggtgctt gggcctgctt acctcctaaa 3060 
aggtgctttg acgcgcgtgc cgtacttcgt cagggctcac gctctactga ggatgtgcac 3120 
catggcaagg catctcgcgg ggggcaggta cgtccagatg gcgctactag cccttggcag 3180 
gtggactggc acttacatct atgaccacct cacccctatg tcggattggg ctgctagtgg 3240 
cctgcgggac ctggcggtcg ccgctgagcc tatcatcttc agtccgatgg agaagaaagt 3300 
cattgtctgg ggagcggaga cagctgcttg tggggacatt ttacacggac ttcccgtgtc 3360 
cgcccgactt ggtcgggagg tcctccttgg cccagctgat ggctatacct ccaaggggtg 3420 
gagtcttctc gcccccatca ctgcttacgc ccagcagaca cgtggccttt tgggcaccat 3480 
agtggtgagc atgacggggc gcgacaagac agaacaggct ggggaaattc aggtcctgtc 3540 
cacagtcact cagtccttcc tcggaacatc catctcgggg gttttgtgga ctgtctacca 3600 
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tggagctggc aacaagactc tggccggctc acggggtccg gtcacgcaga tgtactccag 3660 
tgctgagggg gacttagtag ggtggcccag cccccctggg actaaatctt tggagccgtg 3720 
cacgtgtgga gcggtcgacc tgtacctggt cacgcggaac gctgatgtca tcccggctcg 3780 
aagacgcggg gacaaacggg gagcgctact ctccccgaga cctctttcca ccttgaaggg 3 840 
gtcctcagga ggcccggtgc tatgccccag gggccacgct gtcggagtct tccgggcagc 3900 
tgtgtgctct cggggcgtgg ctaagtccat agatttcatc cccgttgaga cactcgacat 3960 
cgtcacgcgg tcccccacct ttagtgacaa cagcacacca cctgctgtgc cccagaccta 4020 
tcaggtcggg tacttgcatg ccccgactgg cagtggaaag agcaccaaag ttcctgtcgc 4080 
atatgctgct caggggtata aagtgctagt gcttaatccc tcagtggctg ccaccctggg 4140 
gtttggggcg tacttgtcta aggcacatgg catcaatccc aacattagga ctggagtcag 4200 
gactgtgacg accggggcgc ccatcacgta ctccacatat ggcaaattcc Ccgccgatgg 4260 
gggctgtgcg ggcggcgcct acgacatcat catatgtgat gaatgccatg ccgtggactc 4320 
taccaccatc cttggcatcg gaacagtcct tgatcaagca gagacagctg gggtcagact 4380 
aactgtgctg gctacagcta cgccccctgg gtcagtgaca accccccacc ccaacataga 4440 
ggaggtggcc cttgggcagg agggcgagat ccccttctat gggagggcga ttcccctgtc 4500 
ttacatcaag ggaggaagac atctgatctt ctgccattca aagaaaaagt gtgacgagct 4560 
cgcggcggcc cttcggggta tgggcttgaa ctcagtggca tactacagag ggttggacgt 4620 
ctccgtaata ccaactcagg gagacgtagt ggtcgtcgcc accgacgccc tcatgacagg 4680 
gtatactggg gactttgact ccgtgatcga ctgcaacgta gcggtcactc aagttgtaga 4740 
cttcagttta gaccccacat tcaccataac cacacagatt gtccctcaag acgctgtctc 4800 
acgtagccag cgccggggtc gcacgggtag gggaagactg ggcatttata ggtatgtttc 4860 
cactggtgag cgagcctcag gaatgtttga cagtgtagtg ctctgtgagt gctacgacgc 4 920 
aggggccgca tggtatgagc tcacaccatc ggagaccacc gtcaggctca gggcgtattt 4980 
caacacgccc ggtttgcctg tgtgccaaga ccatcttgag ttttgggagg cagttttcac 5040 
cggcctcaca cacatagatg cccacttcct ttcccaaaca aagcaatcgg gggaaaattt 5100 
cgcatactta acagcctacc aggctacagt gtgcgctagg gccaaagccc cccccccgtc 5160 
ctgggacgtc atgtggaagt gtttgactcg actcaagccc acactcgtgg gccccacacc 5220 
tctcctgtac cgcttgggct ctgttaccaa cgaggtcacc ctcacacatc ccgtgacgaa 5280 
atacatcgcc acctgcatgc aagccgacct tgaggtcatg accagcacat gggtcttggc 5340 
agggggagtc ttggcggccg tcgccgcgta ttgcctggcg accgggtgtg tttgcatcat 5400 
cggccgcttg cacattaacc agcgagccgt cgttgcgccg gacaaggagg tcctctatga 5460 
ggcttttgat gagatggagg aatgtgcctc tagggcggct ctcattgaag aggggcagcg 5520 
gatagccgag atgctgaagt ccaagatcca aggcttattg cagcaagctt ccaaacaagc 5580 
tcaagacata caacccactg tgcaggcttc atggcccaag gtagaacaat tctgggccaa 5640 
acacatgtgg aacttcatta gcggcatcca atacctcgca ggactatcaa cactgccagg 5700 
gaaccctgca gtagcttcca tgatggcgtt cagtgccgcc ctcaccagtc cgctgtcaac 5760 
aagcaccact atccttctca acattttggg gggctggcta gcatcccaaa ttgcaccacc 5820 
cgcgggggcc actggcttcg ttgtcagtgg cctagtggga gctgccgtag gcagtatagg 5880 
cttaggtaag gtgctagtgg acatcctggc agggtatggt gcgggcattt cgggggctct 5940 
cgtcgcattc aagatcatgt ctggcgagaa gccctccatg gaggatgtcg tcaacttgct 6000 
gcctggaatt ctgtctccgg gtgccttggt agtgggagtc atctgcgcgg ccattctgcg 6060 
ccgacacgtg ggaccggggg aaggcgccgt ccaatggatg aatagactca ttgcctttgc 6120 
ttccagagga aatcacgtcg cccccaccca ctacgtgacg gagtcggatg cgtcgcagcg 6180 
tgtgacccaa ctacttggct cccttaccat aaccagcctg ctcagaagac tccacaactg 6240 
gattactgag gactgcccca tcccatgcgg cggctcgtgg ctccgcgatg tgtgggactg 6300 
ggtttgcacc atcctaacag actttaaaaa ttggctgacc tccaaattat tcccaaagat 6360 
gcccggcctc ccctttgtct cctgtcaaaa ggggtacaag ggcgtgtggg ccggcactgg 6420 
catcatgacc acacggtgtc cttgcggcgc caatatctct ggcaatgtcc gcttgggctc 6480 
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catgagaatc acggggccta agacctgcat 
ttgttacacg gagggccagt gcgtgccgaa 
gagggtggcg gcctcagagt acgcggaggt 
aggactcacc actgataact tgaaagtccc 
ctgggtggac ggagtgcaga tccataggtt 
tgaggtctcg ttctgcgttg ggcttaattc 
ccctgaaccc gacacagacg tattgatgtc 
ggagactgca gcgcggcgtt tagcgcgggg 
gagccagcta tcggcaccat cgctgcgagc 
tgtggacatg gtggatgcta acctgttcat 
gtccaaagtg gtcgttctgg actctctcga 
gccttcgata ccatcagaat acatgctccc 
ctgggcacgg cctgattaca acccaccgct 
accggccact gttgcgggct gtgctctccc 
aaggaggcgc cggacagtgg gcctaagtga 
ggccattaag tcctttggcc agcccccccc 
gggcgctgcc gattccggca gtcagacgcc 
ttccatctct tccatgcccc ccctcgaggg 
gcaggtagag ccccaacccc ccccccaggg 
gtcctggtct acttgctccg aggaggacga 
ctggaccggg gctctaataa ctccttgtag 
cttgagcaac tccctgttgc gatatcacaa 
ctcactaagg gctaaaaagg taacttttga 
ctcagtctta aaggacatta agctagcggc 
ggaggaggct tgccagttaa ccccacccca 
taaggaggtc cgcagcttgt ccgggagggc 
cctcctggag gactcagaaa caccaattcc 
ctgcgtggac cccaccaagg ggggcaagaa 
cggcgtcagg gtctgcgaga agatggccct 
ggtgatgggg gcttcttatg gattccagta 
gaaagcatgg gcggaaaaga aggaccctat 
ctcaaccgtc actgagagag acatcaggac 
gcccgaggag gcccacactg ccatacactc 
tatgttcaac agcaagggcc aaacctgcgg 
caccactagc atggggaaca ccatcacatg 
tgcagggata atcgcgccca caatgctggt 
aagccagggg accgaggagg acgagcggaa 
gtattctgcc cctcctggtg acccccccag 
ttgctcctca aatgtgtctg tggcgctggg 
cagagaccct accactccaa tcgcccgggc 
caattcatgg ctgggaaaca tcatccagta 
gatgacacac ttcttctcca ttctcatggc 
tgagatgtac ggagcggtgt actccgtgag 
gttacatggg cttgacgctt tttctctgca 
ggcttcagcc ctcagaaaac ttggggcgcc 
tgcagtcagg gcgtccctca tctcccgtgg 
cttcaattgg gcggtgaaga ccaagctcaa 
ggatttatcc agctggttca ccgtcggcgc 



gaatatctgg caggggacct ttcctatcaa 654 0 
acccgcgcca aactttaagg tcgccatctg 6600 
gacgcagcac gggtcatacc actacataac 6660 
ctgccaacta ccctctcccg agttcttttc 6720 
tgcccccaca ccgaagccgt ttttccggga 6780 
atttgtcgtc gggtcccagc ttccttgcga 6840 
catgctaaca gatccatctc atatcacggc 6900 
gtcaccccca tccgaggcaa gctcctcggc 6960 
cacctgcacc acccacggca aagcctatga 7020 
ggggggcgat gtgactcgga tagagtctgg 7080 
cccaatggtc gaagaaagga gcgaccttga 7140 
caagaagagg ttcccaccag ctttaccggc 7200 
tgtggaatcg tggaaaaggc cagattacca 7260 
tcctcctagg aaaaccccga cgcctccccc 732 0 
ggactccata ggagatgccc ttcaacagct 7380 
aagcggcgat tcaggccttt ccacgggggc 7440 
tcctgatgag ttggcccttt cggagacagg 7500 
ggagcttgga gatccagacc tggagcctga 7560 
gggggtggca gctcccggct cggactcggg 7620 
ctccgtcgtg tgctgctcca tgtcatactc 7680 
tcccgaagag gagaagttac cgattaaccc 7740 
caaggtgtac tgtaccacaa caaagagcgc 7800 
taggatgcaa gtgctcgact cctactacga 7860 
ctccaaggtc accgcaaggc tcctcaccat 7920 
ttctgcaaga tctaaatatg ggtttggggc 7980 
cgttaaccac atcaagtccg tgtggaagga 6040 
cacaaccatt atggccaaaa atgaggtgtt 8100 
agcagctcgc cttatcgttt accctgacct 8160 
ttatgacatt acacaaaaac ttcctcaggc 8220 
ttcccccgct cagcgggtag agtttctctt 8260 
gggtttttcg tatgataccc gatgctttga 8340 
tgaggagtcc atatatcggg cctgctcctt 8400 
gctaactgag agactttacg tgggagggcc 8460 
gtacaggcgt tgccgcgcca gcggggtgct 8520 
ctacgtgaaa gccttagcgg cttgtaaagc 8580 
atgcggcgat gacttggttg tcatctcaga 8640 
cctgagagcc ttcacggagg ctatgaccag 8700 
accggagtat gatctggagc tgataacatc 8760 
cccacaaggc cgccgcagat actacctgac 8820 
tgcctgggaa acagttagac actcccctgt 8880 
cgccccgacc atatgggctc gcatggtcct 8940 
tcaagacacg ctggaccaga acctcaactt 9000 
tcccttggac ctcccagcta taattgaaag 9060 
cacatacact ccccacgaac tgacacgggt 9120 
acccctcaga gcgtggaaga gccgggcacg 9180 
ggggagagcg gccgtttgcg gtcgatatct 9240 
actcactcca ttgccggaag cgcgcctcct 9300 
cggcgggggc gacatttatc acagcgtgtc 9360 
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gcgtgcccga ccccgcttat tgctctttgg 
tttcctactc cccgctcggt agagcggcac 
cttttttttt tttttttttt tttttttttt 
tttttttttt tttttttttt tttttctttt 
tactttcttt cctggtggct ccatcttagc 
gagccgcatg actgcagaga gtgccgtaac 



cctactccta ctttttgtag gggtaggcct 9420 
acattagcta cactccatag ctaactgtcc 9480 
tttttttttt tttttttttt tttttttttt 9540 
tttctctttt ccttctttct taccttattt 9600 
cctagtcacg gctagctgtg aaaggtccgt 9660 
tggtctctct gcagatcatg t 9711 



<210> 5 
<2H> 3033 
<212> PRT 

<2ia> Hepatitis C virus 
<400> 5 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
1 5 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 

He Pro Lys Asp Arg Arg Ser Thr Gly Lys Ser Trp Gly Lys Pro Gly 
65 70 75 80 

Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Leu Gly Trp Ala Gly Trp 
65 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Asn Asp Pro 
100 105 110 

Arg His Arg Ser Arg Asn Val Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Met Gly Tyr He Pro Val Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Val Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Phe Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys He Thr Thr Pro Val Ser Ala Ala 
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180 185 190 

Glu Val Lys Asn lie Ser Thr Gly Tyr Met Val Thr Asn Asp Cys Thr 
195 200 205 

Asn Asp Ser He Thr Trp Gin Leu Gin Ala Ala Val Leu His Val Pro 
210 215 220 

Gly Cys Val Pro Cys Glu Lys Val Gly Asn Ala Ser Gin Cys Trp He 
225 230 235 240 

Pro Val Ser Pro Asn Val Ala Val Gin Arg Pro Gly Ala Leu Thr Gin 
245 250 255 

Gly Leu Arg Thr His He Asp Met Val Val Met Ser Ala Thr Leu Cys 
260 265 270 

Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Gly Val Met Leu Ala Ala 
275 280 285 

Gin Met Phe He Val Ser Pro Gin His His Trp Phe Val Gin Asp Cys 
290 295 300 

Asn Cys Ser He Tyr Pro Gly Thr He Thr Gly His Arg Met Ala Trp ... 
305 310 315 320 

Asp Net Met Met Asn Trp Ser Pro Thr Ala Thr Met He Leu Ala Tyr * 
325 330 335 

Ala Met Arg Val Pro Glu Val He He Asp He He Ser Gly Ala His 
340 345 350 

Trp Gly Val Met Phe Gly Leu Ala Tyr Phe Ser Met Gin Gly Ala Trp 
355 360 365 

Ala Lys Val Val Val He Leu Leu Leu Ala Ala Gly Val Asp Ala Arg 
370 375 380 

Thr His Thr Val Gly Gly Ser Ala Ala Gin Thr Thr Gly Arg Leu Thr 
385 390 395 400 

Ser Leu Phe Asp Met Gly Pro Arg Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu His Thr Gly Phe He Ala Ser Leu Phe Tyr Thr His Ser Phe Asn 
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435 440 445 

Ser Ser Gly Cys Pro Glu Arg Met Ser Ala Cys Arg Ser lie Glu Ala 
450 455 460 

Phe Arg Val Gly Trp Gly Ala Leu Gin Tyr Glu Asp Asn Val Thr Asn 
465 470 475 480 

Pro Glu Asp Met Arg Pro Tyr Cys Trp His Tyr Pro Pro Arg Gin Cys 
485 490 495 

Gly Val Val Ser Ala Lys Thr Val Cys Gly Pro Val Tyr Cys Phe Thr 
500 505 510 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Leu Gly Ala Pro Thr 
515 520 525 

Tyr Thr Trp Gly Glu Asn Glu Thr Asp Val Phe Leu Leu Asn Ser Thr 
530 535 540 

Arg Pro Pro Leu Gly Ser Trp Phe Gly Cys Thr Trp Met Asn Ser Ser 
545 550 555 560 

Gly Tyr Thr Lys Thr Cys Gly Ala Pro Pro Cys Arg Thr Arg Ala Asp 
565 570 575 

Phe Asn Ala Ser Thr Asp Leu Leu Cys Pro Thr Asp Cys Phe Arg Lys 
580 585 590 

His Pro Asp Thr Thr Tyr Leu Lys Cys Gly Ser Gly Pro Trp Leu Thr 
595 600 605 

Pro Arg Cys Leu He Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
610 615 620 

Thr Val Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly Gly Val 
625 630 635 640 

Glu His Arg Leu Thr Ala Ala Cys Asn Phe Thr Arg Gly Asp Arg Cys 
645 650 655 

Asn Leu Glu Asp Arg Asp Arg Ser Gin Leu Ser Pro Leu Leu His Ser 
660 665 670 

Thr Thr Glu Trp Ala He Leu Pro Cys Ser Tyr Ser Asp Leu Pro Ala 
675 680 685 

Leu Ser Thr Gly Leu Leu His Leu His Gin Asn He Val Asp Val Gin 
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COO 



700 



Phe Met Tyr Gly Leu Ser Pro Ala Leu Thr Lys Tyr He Val Arg Trp 
705 710 715 720 

Glu Trp Val He Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
725 730 735 

Ala Cys Leu Trp Met Leu lie Leu Leu Gly Gin Ala Glu Ala Ala Leu 
740 745 750 

Glu Lys Leu Val He Leu His Ala Ala Ser Ala Ala Ser Cys Asn Gly 
755 760 765 

Phe Leu Tyr Phe Val He Phe Phe Val Ala Ala Trp Tyr He Lys Gly 
770 775 780 

Arg Val Val Pro Leu Ala Thr Tyr Ser Leu Thr Gly Leu Trp Ser Phe 
785 790 795 800 

Ser Leu Leu Leu Leu Ala Leu Pro Gin Gin Ala Tyr Ala Tyr Asp Ala 
805 810 815 

Ser Val His Gly Gin lie Gly Ala Ala Leu Leu Val Met He Thr Leu 
820 825 830 

Phe Thr Leu Thr Pro Gly Tyr Lys Thr Leu Leu Ser Arg Phe Leu Trp 
835 840 845 

Trp Leu Cys Tyr Leu Leu Thr Leu Gly Glu Ala Met Val Gin Glu Trp 
850 855 860 

Ala Pro Pro Met Gin Val Arg Gly Gly Arg Asp Gly He He Trp Ala 
865 870 875 880 

Val Ala He Phe Tyr Pro Gly Val Val Phe Asp He Thr Lys Trp Leu 
885 890 895 

Leu Ala Val Leu Gly Pro Ala Tyr Leu Leu Lys Gly Ala Leu Thr Arg 
900 905 910 

Val Pro Tyr Phe Val Arg Ala His Ala Leu Leu Arg Met Cys Thr Met 
915 920 925 

Ala Arg His Leu Ala Gly Gly Arg Tyr Val Gin Met Ala Leu Leu Ala 
930 935 940 

Leu Gly Arg Trp Thr Gly Thr Tyr He Tyr Asp His Leu Thr Pro Met 
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^43 .960 

Ser Asp Trp Ala Ala Ser Gly Leu Arg Asp Leu Ala Val Ala Val Glu 
965 970 975 

Pro lie lie Phe Ser Pro Met Glu Lys Lys Val lie Val Trp Gly Ala 
980 985 990 

Glu Thr Ala Ala Cys Gly Asp lie Leu His Gly Leu Pro Val Ser Ala 
995 1000 1005 

Arg Leu Gly Arg Glu Val Leu Leu Gly Pro Ala Asp Gly Tyr Thr Ser 
1010 1015 1020 

Lys Gly Trp Ser Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin Thr 
1025 1030 1035 1040 

Arg Gly Leu Leu Gly Thr He Val Val Ser Met Thr Gly Arg Asp Lys 
1045 1050 1055 

Thr Glu Gin Ala Gly Glu He Gin Val Leu Ser Thr Val Thr Gin Ser 
1060 1065 1070 

Phe Leu Gly Thr Ser He Ser Gly Val Leu Trp Thr Val Tyr His Gly 
1075 1080 1085 

Ala Gly Asn Lys Thr Leu Ala Gly Ser Arg Gly Pro Val Thr Gin Met 
1090 1095 1100 

Tyr Ser Ser Ala Glu Gly Asp Leu Val Gly Trp Pro Ser Pro Pro Gly 
1105 1110 1115 1120 

Thr Lys Ser Leu Glu Pro Cys Thr Cys Gly Ala Val Asp Leu Tyr Leu 
1125 1130 1135 

Val Thr Arg Asn Ala Asp Val He Pro Ala Arg Arg Arg Gly Asp Lys 
1140 1145 1150 

Arg Gly Ala Leu Leu Ser Pro Arg Pro Leu Ser Thr Leu Lys Gly Ser 
1155 1160 1165 

Ser Gly Gly Pro Val Leu Cys Pro Arg Gly His Ala Val Gly Val Phe 
1170 1175 1180 

Arg Ala Ala Val Cys Ser Arg Gly Val Ala Lys Ser He Asp Phe He 
1185 1190 1195 1200 

Pro Val Glu Thr Leu Asp He Val Thr Arg Ser Pro Thr Phe Ser Asp 
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Asn Ser Thr Pro Pro Ala Val Pro Gin Thr Tyr Gin Val Gly Tyr Leu 
1220 1225 1230 

His Ala Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Val Ala Tyr 
1235 1240 1245 

Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 
1250 1255 1260 

Thr Leu Gly Phe Gly Ala Tyr Leu Ser Lys Ala His Gly lie Asn Pro 
1265 1270 1275 1280 

Asn lie Arg Thr Gly Val Arg Thr Val Thr Thr Gly Ala Pro He Thr 
1285 1290 1295 

Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys Ala Gly Gly 
1300 1305 1310 

Ala Tyr Asp He He He Cys Asp Glu Cys His Ala Val Asp Ser Thr 
1315 1320 1325 

Thr He Leu Gly He Gly Thr Val Leu Asp Gin Ala Glu Thr Ala Gly 
1330 1335 1340 

Val Arg Leu Thr Val Leu Ala Thr Ala Thr Pro Pro Gly Ser Val Thr 
1345 1350 1355 1360 

Thr Pro His Pro Asn He Glu Glu Val Ala Leu Gly Gin Glu Gly Glu 
1365 1370 1375 

He Pro Phe Tyr Gly Arg Ala He Pro Leu Ser Tyr He Lys Gly Gly 
1380 1385 1390 

Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys Asp Glu Leu Ala 
1395 1400 1405 

Ala Ala Leu Arg Gly Met Gly Leu Asn Ser Val Ala Tyr Tyr Arg Gly 
1410 1415 1420 

Leu Asp Val Ser Val He Pro Thr Gin Gly Asp Val Val Val Val Ala 
1425 1430 1435 1440 

Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He 
1445 1450 1455 

Asp Cys Asn Val Ala Val Thr Gin Val Val Asp Phe Ser Leu Asp Pro 
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Thr Phe Thr lie Thr Thr Gin He Val Pro Gin Asp Ala Val Ser Arg 
1475 1480 1485 

Ser Gin Arg Arg Gly Arg Thr Gly Arg Gly Arg Leu Gly He Tyr Arg 
1490 1495 1500 

Tyr Val Ser Thr Gly Glu Arg Ala Ser Gly Met Phe Asp Ser Val Val 
1505 1510 1515 1520 

Leu Cys Glu Cys Tyr Asp Ala Gly Ala Ala Trp Tyr Glu Leu Thr Pro 
1525 1530 1535 

Ser Glu Thr Thr Val Arg Leu Arg Ala Tyr Phe Asn Thr Pro Gly Leu 
1540 1545 1550 

Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ala Val Phe Thr Gly 
1555 1560 1565 

Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr Lys Gin Ser Gly 
1570 1575 1580 

Glu Asn Phe Ala Tyr Leu Thr Ala Tyr Gin Ala Thr Val Cys Ala Arg 
1585 1590 1595 1600 ' 

Ala Lys Ala Pro Pro Pro Ser Trp Asp Val Met Trp Lys Cys Leu Thr 
1605 1610 1615 

Arg Leu Lys Pro Thr Leu Val Gly Pro Thr Pro Leu Leu Tyr Arg Leu 
1620 1625 1630 

Gly Ser Val Thr Asn Glu Val Thr Leu Thr His Pro Val Thr Lys Tyr 
1635 1640 1645 

He Ala Thr Cys Met Gin Ala Asp Leu Glu Val Met Thr Ser Thr Trp 
1650 1655 1660 

Val Leu Ala Gly Gly Val Leu Ala Ala Val Ala Ala Tyr Cys Leu Ala 
1665 1670 1675 1680 

Thr Gly Cys Val Cys He He Gly Arg Leu His He Asn Gin Arg Ala 
1685 1690 1695 

Val Val Ala Pro Asp Lys Glu Val Leu Tyr Glu Ala Phe Asp Glu Met 
1700 1705 1710 

Glu Glu Cys Ala Ser Arg Ala Ala Leu He Glu Glu Gly Gin Arg He 
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1715 1720 1725 

Ala Glu Met beu hye Ser Lys lie Gin Gly Leu Leu Gin Gin Ala Ser 
1730 1735 1740 

Lys Gin Ala Gin Asp lie Gin Pro Thr Val Gin Ala Ser Trp Pro Lys 
1745 1750 1755 1760 

Val Glu Gin Phe Trp Ala Lys His Met Trp Asn Phe He Ser Gly He 
1765 1770 1775 

Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro Ala Val Ala 
1780 1785 1790 

Ser Met Met Ala Phe Ser Ala Ala Leu Thr Ser Pro Leu Ser Thr Ser 
1795 1800 1805 

Thr Thr He Leu Leu Asn He Leu Gly Gly Trp Leu Ala Ser Gin He 
1810 1815 1820 

Ala Pro Pro Ala Gly Ala Thr Gly Phe Val Val Ser Gly Leu Val Gly 
1825 1830 1835 1840 

Ala Ala Val Gly Ser He Gly Leu Gly Lys Val Leu Val Asp He Leu 
1845 1850 1855 

Ala Gly Tyr Gly Ala Gly He Ser Gly Ala Leu Val Ala Phe Lys He 
1860 1865 1870 

Met Ser Gly Glu Lys Pro Ser Met Glu Asp Val Val Asn Leu Leu Pro 
1875 1880 1885 

Gly He Leu Ser Pro Gly Ala Leu Val Val Gly Val He Cys Ala Ala 
1890 1895 1900 

He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala Val Gin Trp Met 
1905 1910 1915 1920 

Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His Val Ala Pro Thr 
1925 1930 1935 

His Tyr Val Thr Glu Ser Asp Ala Ser Gin Arg Val Thr Gin Leu Leu 
1940 1945 1950 

Gly Ser Leu Thr He Thr Ser Leu Leu Arg Arg Leu His Asn Trp He 
1955 1960 1965 

Thr Glu Asp Cys Pro He Pro Cys Gly Gly Ser Trp Leu Arg Asp Val 
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Trp Asp Trp Val Cys Thr He Leu Thr Asp Phe Lys Asn Trp Leu Thr 
1985 1990 1995 2000 

Ser Lys Leu Phe Pro Lys Met Pro Gly Leu Pro Phe Val Ser Cys Gin 
2005 2010 2015 

Lys Gly Tyr Lys Gly Val Trp Ala Gly Thr Gly He Met Thr Thr Arg 
2020 2025 2030 

Cys Pro Cys Gly Ala Asn He Ser Gly Asn Val Arg Leu Gly Ser Met 
2035 2040 2045 

Arg He Thr Gly Pro Lys Thr Cys Met Asn He Trp Gin Gly Thr Phe 
2050 2055 2060 

Pro He Asn Cys Tyr Thr Glu Gly Gin Cys Val Pro Lys Pro Ala Pro 
2065 2070 2075 20B0 

Asn Phe Lys Val Ala He Trp Arg Val Ala Ala Ser Glu Tyr Ala Glu 
2085 2090 2095 

Val Thr Gin His Gly Ser Tyr His Tyr He Thr Gly Leu Thr Thr Asp 
2100 2105 2110 

Asn Leu Lys Val Pro Cys Gin Leu Pro Ser Pro Glu Phe Phe Ser Trp 
2115 2120 2125 

Val Asp Gly Val Gin He His Arg Phe Ala Pro Thr Pro Lys Pro Phe 
2130 2135 2140 

Phe Arg Asp Glu Val Ser Phe Cys Val Gly Leu Asn Ser Phe Val Val 
2145 2150 2155 2160 

Gly Ser Gin Leu Pro Cys Asp Pro Glu Pro Asp Thr Asp Val Leu Met 
2165 2170 2175 

Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu Thr Ala Ala Arg 
2180 2185 2190 

Arg Leu Ala Arg Gly Ser Pro Pro Ser Glu Ala Ser Ser Ser Ala Ser 
2195 2200 2205 

Gin Leu Ser Ala Pro Ser Leu Arg Ala Thr Cys Thr Thr His Gly Lys 
2210 2215 2220 

Ala Tyr Asp Val Asp Met Val Asp Ala Asn Leu Phe Met Gly Gly Asp 
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2240 



Val Thr Arg lie Glu Ser Gly Ser Lys Val Val Val Leu Asp Ser Leu 
2245 2250 2255 

Asp Pro Met Val Glu Glu Arg Ser Asp Leu Glu Pro Ser He Pro Ser 
2260 2265 2270 

Glu Tyr Met Leu Pro Lys Lys Arg Phe Pro Pro Ala Leu Pro Ala Trp 
2275 2280 2285 

Ala Arg Pro Asp Tyr Asn Pro Pro Leu Val Glu Ser Trp Lys Arg Pro 
2290 2295 2300 

Asp Tyr Gin Pro Ala Thr Val Ala Gly Cys Ala Leu Pro Pro Pro Arg 
2305 2310 2315 2320 

Lys Thr Pro Thr Pro Pro Pro Arg Arg Arg Arg Thr Val Gly Leu Ser 
2325 2330 2335 

Glu Asp Ser He Gly Asp Ala Leu Gin Gin Leu Ala He Lys Ser Phe 
2340 2345 2350 

Gly Gin Pro Pro Pro Ser Gly Asp Ser Gly Leu Ser Thr Gly Ala Gly 
2355 2360 2365 

Ala Ala Asp Ser Gly Ser Gin Thr Pro Pro Asp Glu Leu Ala Leu Ser 
2370 2375 2380 

Glu Thr Gly Ser He Ser Ser Met Pro Pro Leu Glu Gly Glu Leu Gly 
2385 2390 2395 2400 

Asp Pro Asp Leu Glu Pro Glu Gin Val Glu Pro Gin Pro Pro Pro Gin 
2405 2410 2415 

Gly Gly Val Ala Ala Pro Gly Ser Asp Ser Gly Ser Trp Ser Thr Cys 
2420 2425 2430 

Ser Glu Glu Asp Asp Ser Val Val Cys Cys Ser Met Ser Tyr Ser Trp 
2435 2440 2445 

Thr Gly Ala Leu He Thr Pro Cys Ser Pro Glu Glu Glu Lys Leu Pro 
2450 2455 2460 

He Asn Pro Leu Ser Asn Ser Leu Leu Arg Tyr His Asn Lys Val Tyr 
2465 2470 2475 2480 

Cys Thr Thr Thr Lys Ser Ala Ser Leu Arg Ala Lys Lys val Thr Phe 
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24B5 2490 2495 

Asp Arg Met Gin Val Leu Asp Ser Tyr Tyr Asp Ser Val Leu Lys Asp 
2500 2505 2510 

lie Lys Leu Ala Ala Ser Lys Val Thr Ala Arg Leu Leu Thr Met Glu 
2515 2520 2525 

Glu Ala Cys Gin Leu Thr Pro Pro His Ser Ala Arg Ser Lys Tyr Gly 
2530 2535 2540 

Phe Gly Ala Lys Glu Val Arg Ser Leu Ser Gly Arg Ala Val Asn His 
2545 2550 2555 2560 

lie Lys Ser Val Trp Lys Asp Leu Leu Glu Asp Ser Glu Thr Pro lie 
2565 2570 2575 

Pro Thr Thr lie Met Ala Lys Asn Glu Val Phe Cys Val Asp Pro Thr 
2580 2585 2590 

Lys Gly Gly Lys Lys Ala Ala Arg Leu lie Val Tyr Pro Asp Leu Gly 
2595 2600 2605 

Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp lie Thr Gin Lys Leu 
2610 2615 2620 

Pro Gin Ala Val Met Gly Ala Ser Tyr Gly Phe Gin Tyr Ser Pro Ala 
2625 2630 2635 2640 

Gin Arg Val Glu Phe Leu Leu Lys Ala Trp Ala Glu Lys Lys Asp Pro 
2645 2650 2655 

Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val Thr Glu 
2660 2665 2670 

Arg Asp lie Arg Thr Glu Glu Ser lie Tyr Arg Ala Cys Ser Leu Pro 
2675 2680 2685 

Glu Glu Ala His Thr Ala lie His Ser Leu Thr Glu Arg Leu Tyr Val 
2690 2695 2700 

Gly Gly Pro Met Phe Asn Ser Lys Gly Gin Thr Cys Gly Tyr Arg Arg 
2705 2710 2715 2720 

Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Met Gly Asn Thr He Thr 
2725 2730 2735 

Cys Tyr Val Lys Ala Leu Ala Ala Cys Lys Ala Ala Gly He He Ala 
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2740 2745 2750 

Pro Thr Met Leu Val Cys Gly Asp Asp Leu Val Val lie Ser Glu Ser 
2755 2760 2765 

Gin Gly Thr Glu Glu Asp Glu Arg Asn Leu Arg Ala Phe Thr Glu Ala 
2770 2775 2780 

Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Arg Pro Glu Tyr 
2785 2790 2795 2800 

Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val Ala Leu 
2805 2810 2815 

Gly Pro Gin Gly Arg Arg Arg Tyr Tyr Leu Thr Arg Asp Pro Thr Thr 
2820 2825 2830 

Pro He Ala Arg Ala Ala Trp Glu Thr Val Arg His Ser Pro Val Asn 
2835 2840 2845 

Ser Trp Leu Gly Asn He He Gin Tyr Ala Pro Thr He Trp Ala Arg 
2850 2855 2860 

Met Val Leu Met Thr His Phe Phe Ser He Leu Met Ala Gin Asp Thr 
2865 2870 2875 2880 

Leu Asp Gin Asn Leu Asn Phe Glu Met Tyr Gly Ala Val Tyr Ser Val 
2885 2890 2895 

Ser Pro Leu Asp Leu Pro Ala He He Glu Arg Leu His Gly Leu Asp 
2900 2905 2910 

Ala Phe Ser Leu His Thr Tyr Thr Pro His Glu Leu Thr Arg Val Ala 
2915 2920 2925 

Ser Ala Leu Arg Lys Leu Gly Ala Pro Pro Leu Arg Ala Trp Lys Ser 
2930 2935 2940 

Arg Ala Arg Ala Val Arg Ala Ser Leu He Ser Arg Gly Gly Arg Ala 
2945 2950 2955 2960 

Ala Val Cys Gly Arg Tyr Leu Phe Asn Trp Ala Val Lys Thr Lys Leu 
2965 2970 2975 

Lys Leu Thr Pro Leu Pro Glu Ala Arg Leu Leu Asp Leu Ser Ser Trp 
2980 2985 2990 

Phe Thr Val Gly Ala Gly Gly Gly Asp He Tyr His Ser Val Ser Arg 
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2995 3000 3005 

Ala Arg Pro Arg Leu Leu Leu Phe Gly Leu Leu Leu Leu Phe Val Gly 
3010 3015 3020 

Val Gly Leu Phe Leu Leu Pro Ala Arg 
3025 3030 
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